Esha Aishwarya — Operations Associate
As a Data Engineer at Genpact, I specialize in building scalable, optimized, and automated data solutions that enhance efficiency and insights. Starting as a One Data intern and moving into a full-time role, I’ve developed expertise in cloud data engineering, pipeline design, and AI-driven automation. Currently, I work on a multi-client invoice processing product using Azure Databricks, SQL, SFTP, and Databricks Workflows. I design and optimize pipelines to automate invoice data ingestion, transformation, and validation from multiple sources, ensuring accuracy, traceability, and consistency. The solution follows a Medallion architecture (Bronze–Silver–Gold), with curated SQL views above the Gold layer powering Power BI dashboards for client and financial reporting. I also create Python automation scripts for workflow orchestration, error handling, and monitoring, improving reliability and reducing manual effort. Earlier, I led a data migration project on Databricks, converting complex PostgreSQL pipelines into Spark SQL, achieving up to 60% performance gains through distributed processing and query optimization. I handled code translation, validation, and benchmarking to ensure seamless migration and reliable delivery. The data model followed the Bronze–Silver–Gold pattern, with Gold-layer views integrated into Tableau for real-time business reporting and insights. During my internship, I contributed to a Hugging Face project, implementing data lineage tracing and model metadata extraction to improve ML model transparency and governance. This enhanced model usability and collaboration across teams. I’ve also done web scraping using BeautifulSoup and Selenium, extracting insights from Amazon, Flipkart, Nykaa, and YouTube for analytics. Additionally, I built AI-based tools using OpenAI’s GPT-3.5 to automate PDF invoice and Excel reconciliation, improving accuracy and efficiency. I’m skilled in AWS (S3, Glue, Athena) for ETL workflows and proficient in Excel for analysis and visualization. My strengths lie in data integration, performance tuning, and automation. I’m passionate about Generative AI, NLP, and Machine Learning, continually exploring them through hands-on projects and certifications. Tech Stack: Spark SQL | Azure Databricks | PostgreSQL | PySpark | SQL | Python | Power BI | Tableau | AWS (S3, Glue, Athena) | SFTP | Databricks Workflows | Pandas | NumPy | Scikit-learn | BeautifulSoup | Selenium | Hugging Face | Git | Excel | OpenAI API
Stackforce AI infers this person is a Data Engineer specializing in SaaS and AI/ML solutions.
Location: Bengaluru, Karnataka, India
Experience: 1 yr 9 mos
Skills
- Data Engineering
- Apache Spark
- Performance Optimization
- Web Scraping
- Data Governance
- Machine Learning
- Data Analysis
- Sql
Career Highlights
- Achieved 60% performance gains in data migration projects.
- Expert in building automated data solutions using Azure Databricks.
- Passionate about Generative AI and Machine Learning.
Work Experience
Wells Fargo
Data Management Associate (2 mos)
Genpact
Data Engineer (1 yr 7 mos)
One Data Intern (5 mos)
EATCLUB Brands (Formerly BOX8)
Data analyst intern (2 mos)
Internshala
Internshala Student Partner (1 mo)
Education
Bachelor of Technology - BTech at Vellore Institute of Technology