Principal Data Engineer & Open Source Contributor
I'm Shaurya, a Principal Data Engineer with over 15 years of experience designing and operating production data systems at scale. I currently work at a leading fintech, where I architect large-scale streaming and analytics platforms.
I've built data infrastructure for companies like HelloFresh, SAP, Red Bull, and New Work, spanning event-driven architectures, real-time analytics pipelines, and cloud-native data platforms processing millions of events daily.
My toolkit includes Kafka, Spark, Snowflake, BigQuery, Python, Java, Scala, Rust, and cloud platforms (AWS & GCP). I'm particularly into high-throughput streaming systems, data architecture, and writing performant code.
I've contributed to open-source projects including Apache Hudi, Feast Feature Store, and Open Metadata, and have published research on NLP and machine learning.