Berlin Buzzwords 2016: Julien Le Dem - Efficient Data formats for Analytics with Parquet and Arrow
Julien Le Dem – Cross-Platform Data Lineage with OpenLineage
Parquet and Arrow - Julien Le Dem
Berlin Buzzwords 2016: Ted Dunning - Fast Cars, Big Data - How Streaming Can Help Formula 1 #bbuzz
Twitter's Julien Le Dem talks about Parquet: open columnar storage for Hadoop
Ten years of building open source standards: From Parquet to Arrow to OpenLineage | Astronomer
Columnar Data: Apache Arrow and Parquet with Julien Le Dem and Jacques Nadeau
Berlin Buzzwords 2011: Doug Cutting - Keynote: Apache Hadoop a new Paradigm #bbuzz
#BDAM: SQL-on-Everything with Apache Drill by Julien Le Dem, Dremio
Apache Parquet: Parquet file internals and inspecting Parquet file structure
Why Apache Arrow is Taking the Open Source Community by Storm
Power a Better Future (sponsored by MemSQL) - Drew Paroski (MemSQL)
Parquet Format at Criteo
Gobblin based distcp (inter cluster copy)
Ted Dunning on Kafka, MapR, Drill, Apache Arrow and More
What's new in Gobblin 0.7
Class 5 - Column Encoding Explained
Install Apache Drill on top of AWS S3
Anshum Gupta - SolrCloud and NoSQL
Out-of-Core Columnar Datasets