T277552 project jdata store as parquet (#10)
* Project instanceof in model output * Upload raw model output to HDFS as paruqet * Add elt to PYTHONPATH when running pytest * Copy raw data to HDFS and convert it to parquet * Update doc * Add instance of to imagerec and store content as parquet * Fix. append to PYTHONPATH * Add placeholder instanceof column in mocks
etl/raw2parquet.py
0 → 100644
etl/schema.py
0 → 100644
Please register or sign in to comment