Sri Chavali — CEO
I like to build and experiment with new things. I love coding and the math behind all the complex predictive algorithms/models. Learning by coding is my motto, be it some algorithm or a new programming language. I have Expertise in building high-available, resilient, scalable, low-latency microservices in a distributed architecture and very passionate about database internals, data modeling, data Warehousing, high-performance analytics, big data, ML, Deep learning, and anything related to data:) At Microsoft, I am part of a data platform team. ● Led a team of three to design and implement self-serve reporting frameworks and up-level the data quality for Microsoft 365. ● Architected and implemented a Centralized Ingestion framework and data-aware pre-computation and aggregation that handles hundreds of Terabytes of daily data. ● Solved complex organizational challenges involving disparate data sources by designing a Unified Metrics framework as the single source of truth. Defined the standards and created a process of how Microsoft 365 should onboard new metrics and datasets. ● Developed Autotune using historical data and ML models to predict spark resource usage, which saved Microsoft $2 million. ● Converted the 100 TB daily batch pipeline to streaming and improved the data freshness latency by 20X. Implemented Exactly-once semantics using Flink checkpointing and Delta Lake Atomic commits. Solved complex challenges related to small files by implementing dynamic partitioning and reduced the number of output files from millions to thousands. ● Built and Implemented a two-year roadmap for Microsoft 365 to have the highest quality data. At VMware, I was part of the VROPS team, which collects and processes billions of real-time metrics and time series data daily. ● Led a team of three in managing a complex ecosystem built on Kafka, Flink, and Cassandra, demonstrating leadership skills and proficiency in these technologies. ● Optimized ingestion pipelines resulting in reduction of daily storage from 3TB to 40GB, highlighting efficiency in resource management. Reduced the storage and computing costs by 60%. Saving VMware > $1 million per year. ● Scaled Kafka to ingest billions of messages daily across hundreds of nodes in geo-replicated Active-Active clusters, showcasing ability to handle large-scale data processing. ● Implemented Distributed Profiler on Apache Spark applications leading to a reduction of memory usage by 500GB, demonstrating expertise in performance optimization.
Stackforce AI infers this person is a Backend-heavy Fullstack Engineer specializing in Data Engineering and Analytics.
Location: Hyderabad, Telangana, India
Experience: 15 yrs 1 mo
Skills
- Data Quality Framework
- Data Validation
- Data Ingestion
- Data Quality
- Data Processing
- Api Development
- Microservices
- Performance Optimization
Career Highlights
- Saved Microsoft $2 million with ML resource prediction.
- Converted 100 TB batch pipeline to streaming, improving latency by 20X.
- Optimized ingestion pipelines at VMware, saving over $1 million annually.
Work Experience
Oracle
Consulting Member of Technical Staff (1 yr 6 mos)
Self-Employed
Lead Software Engineer (1 yr)
Microsoft
Senior Software Engineer (2 yrs)
Versa Networks
Software Engineer (1 yr 2 mos)
VMware
Software Engineer (9 yrs 5 mos)
Education
Master's degree at Cleveland State University