PROFILE SUMMARY
Alluxio, spun out of UC Berkeley, brings data closer to compute in on-premise and cloud.
At BGC, I worked as an architect, data engineer and an admin. Produced architectures for low latency systems for brokers to see trades and orders and for regulatory requirements.
On premise to cloud (AWS) migration, develop, automate ETL pipelines on top of S3 Data Lake and reporting
Created ETL pipelines to consume data from multiple sources, transform into harmonized data and produce monthly account statements for retail and corporate customers.
BigData Expertise – Databricks, Alluxio, Spark, SparkSQL, Spark Streaming, Flink, Hive, HDFS, Sqoop, MapReduce, Kerberos, Ranger, Knox, Oozie, Nifi, Kafka, Zeppelin, Apache Iceberg, Delta Lake
undefined