Commit 131a8375 authored by Clarakosi's avatar Clarakosi
Browse files

Initial draft of refactoring efforts

Changes:
* Adds spark udf
* Modifies schema for top_candidates column to now view null
image suggestions as an empty array
* Saves output as parquet in hdfs
parent 7cb80f12
This diff is collapsed.
Supports Markdown
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment