Ritu Parno Behera

Senior Software Engineer

Bengaluru, Karnataka, India7 yrs 11 mos experience

Key Highlights

6+ years of experience in software development.
Expertise in building scalable data platforms.
Award-winning innovative solutions in data engineering.

Stackforce AI infers this person is a Data Engineer with expertise in building scalable data solutions in the SaaS industry.

Contact

Skills

Core Skills

Data ArchitectureApache SparkApache KafkaAwsHadoopData AnalysisData Engineering

Other Skills

AlgorithmsAmazon Web Services (AWS)Apache AirflowApache Delta LakeApache OozieBig DataBuilding PerformanceCompetitive ProgrammingData ModelsData QualityData ScienceData StructuresDebuggingDeep LearningDeployment Strategies

About

Experienced Software Developer | 6+ Years of Mastery in Crafting Optimal and Innovative Solutions | Acknowledged for Innovative Ideation, Leadership, and Design As a SDE II at Groupon, I work in a dynamic environment and solve real-world problems in a creative manner. I leverage my expertise in AWS, MySQL, and Hadoop to build scalable and reliable data platforms and pipelines that support various business functions and goals. I have a strong passion for data and competitive programming, which I developed during my bachelor's degree in computer science from NIST. I have won multiple awards and honors for my innovative ideas and solutions, such as the Startup Odisha, the Smart India Hackathon, and the Google Kickstart. I also completed several certifications in neural networks, deep learning, and data science to enhance my skills and knowledge. I aspire to continue learning and growing as a data engineer and contribute to the advancement of the field.

Experience

7 yrs 11 mos

Total Experience

1 yr 3 mos

Average Tenure

1 yr 5 mos

Current Experience

Walmart global tech india

Senior Software Engineer

Jan 2025 – Present · 1 yr 5 mos · Bengaluru, Karnataka, India · Hybrid

Groupon

Software Development Engineer II

Sep 2022 – Jan 2025 · 2 yrs 4 mos · Bengaluru, Karnataka, India · Hybrid

1. Designed & Implemented User Analytics for Campaign Optimization
2. Built a Real-time Audience Publication System - For instant Marketing (Email, Push, Social Media, Affiliate, etc)
3. Led a seamless migration from AWS to GCP, optimizing Kafka, Spark Streaming, and reducing costs by $900.
4. Developed a data quality framework, cutting data anomalies by 75% and improving reliability.
5. Re-architected the data pipeline, implementing incremental processing and removing redundant attributes with Apache Delta Lake
6. Cost-saving OLM Policy
7. Implemented an Object Lifecycle Management policy, saving $76,000 annually in GCP storage costs.
8. Integrated Bigtable into the system and revamped user segmentation for better performance and cost efficiency.
9. Migrated a Spark streaming service to GCP, optimising customer-triggered actions.
10. Expert in ensuring system reliability, troubleshooting, and resolving performance bottlenecks.

HiveOrchestrationTeam ProductivityApache AirflowHadoopAlgorithms+19

Flipkart

Senior Data Engineer

Feb 2022 – Sep 2022 · 7 mos · Bangalore Urban, Karnataka, India

HiveOrchestrationData AnalysisTeam ProductivityApache AirflowHadoop+17

Paytm

Data Platform Engineer

Jan 2021 – Feb 2022 · 1 yr 1 mo

1. Developed a centralized data catalog for compliance within Hadoop, integrating it with enterprise data
tools.
2. Designed and implemented OLAP systems to enhance data analysis and reporting.
3. Led EMR cluster migration (v5.3.0 to v6.3.0) and Spark upgrade (v2.4.7 to v3.1.0) for improved
performance.
4. Created an automated S3 Cleaner tool to optimize data storage and cost management.
5. Conducted POCs with various technologies, informing strategic decisions.
6. Optimized Spark performance for efficient data processing.
7. Built a data-builder microservice for seamless metadata updates.
8. Introduced a data-steward microservice for manual metadata control.
9. Integrated AWS Deequ for enhanced data quality assurance.
10. Implemented data anomaly detection feature using Deequ metrics.
11. Proficiently used Docker for production-ready Python applications.
12. Identified and reported a critical bug during EMR migration.

HiveOrchestrationTeam ProductivityApache AirflowHadoopAlgorithms+20

Goldman sachs

Contigent Worker(Data Engineering)

Feb 2020 – Jan 2021 · 11 mos · Bengaluru, Karnataka, India

Responsibilities
1. Built ETL applications that calculate accurate brokerage.
2. Restructured existing design to improve performance significantly.
3. Built stream processing application for real time processing.
4. Handled Data Parsing, Cleansing, Quality definitions, Data Pipeline.
5. Built and conceptualized data flow architecture with NiFi.
6. Managed NiFi Administration setting-up:
Zero master cluster
Security Configurations
Authentication/Multitenant Authorization & securing connection with SSL
Build and developed complex NiFi workflows.
Improved Performance by using Load Balancing concept.
Impact
Highly scalable data flow which facilitates flow of data across variety of software system with very low latency with distributed computation in near real time.
Impact
1. Ceased leakage of unnecessary brokerage.
2. Increased Spark Application performance up-to 60%.
Technologies worked on:
Java, Scala, Python, Apache Spark, Hive, Hbase, NiFi, Kafka, Gitlab, AWS (S3, Lambda, EMR, RDS, VPC, Security Group, Elastic IP), Drop-wizard, Django, SBT/Maven, Postman Client, Agile/Waterfall Model, Junit, Cucumber

HiveOrchestrationData AnalysisTeam ProductivityHadoopAlgorithms+22

Mindtree

Data Engineer

Jul 2018 – Feb 2020 · 1 yr 7 mos · Bangalore

Client: JW Marriott
Involved in a migration project where the revenue management system was being migrated to modern technology using Big Data tools like- Spark, Kafka, NiFi, Oozie.
Responsibilities
Designed ETL pipeline, automated data flow across software systems.
Built complex workflows to schedule and orchestrate Hadoop Jobs.
Developed applications to process data on distributed Systems.
Was Responsible for creating, storing, aggregating, transforming, presenting structured and unstructured data and deploying Hadoop Jobs to
Cloud.
Automated Hadoop Job’s report generation, error-prone manual data comparison process and saved time for other issues.
With strong data modelling skills and data analytic skills was able to predict accurate hotel room booking price using Apache PySpark.'
Technologies worked on:
Java, Scala, Python, Microsoft SQL, Oracle, MySQL, PostgreSQL, - Apache Spark, Oozie, NiFi, Kafka, HBase, Hive, PySpark.