Ayush Pathak

Software Engineer

India10 yrs 7 mos experience

Key Highlights

  • 9+ years of experience in data engineering.
  • Expert in architecting data solutions on AWS.
  • Proven track record of optimizing data processing workflows.
Stackforce AI infers this person is a Data Engineering expert specializing in Big Data and Cloud solutions.

Contact

Skills

Core Skills

Data EngineeringAwsBig DataHadoop DevelopmentBusiness AnalysisWeb Application DevelopmentDatabase Management

Other Skills

Apache AirflowScalaDockerPythonShell scriptingScala SparkPySparkApache SparkAWS EMRAWS GlueRESTful APIsRabbitMQHadoopProject CoordinationCore Java

About

Key-Skills: DataEngineering,Pyspark,Scala Spark,Python,Java,Airflow,AWS-EMR,EC2,S3,ECS,ECR,Databricks,GitlabCICD Education: • B.E. in Computer Science and Engineering, Jabalpur Engineering College (CGPA: 7.4) • Currently pursuing Executive PG Programme in Machine Learning and AI, IIIT-Bangalore Professional Summary: Senior Data Engineer with 9+ years of experience in developing and optimizing large-scale data processing solutions. Currently leading a team at Nielsen, architecting state-of-the-art data fusion processes for the Audience Measurement domain using AWS cloud infrastructure. Key Achievements: • Implemented cutting-edge ETL applications for daily, monthly, and yearly data processes at Nielsen • Optimized data processing workflows at Advantmed, reducing computation time from 20 to 3 hours • Led Hadoop development initiatives and performed business analysis at S.V.N.I.T University Skills: • Programming: Scala, Python, Java, SQL, Shell Scripting • Big Data: Apache Spark, Hadoop, Hive, MapReduce • Cloud: AWS (S3, EMR, EC2, Lambda, ECR, ECS, VPC, Athena, IAM, Glue) • Databases: Oracle 12c, HBase, MySQL • Tools: Docker, Apache Airflow, Databricks, GitLab, DevOps CI/CD Professional Experience: • Senior Data Engineer, Nielsen (Current) • IT Analyst, Tata Consultancy Services • Big Data Engineer, Advantmed India LLP • Senior System Engineer, S.V.N.I.T University • System Engineer, Infosys Passionate about solving complex data challenges and continuously expanding expertise in big data technologies. Skilled in designing robust ETL pipelines and leveraging advanced analytics for business insights. Committed to fostering innovation and mentoring the next generation of data engineers. Seeking opportunities to apply my unique blend of technical expertise, business acumen, and leadership skills in pushing the boundaries of what's possible in the world of data engineering.

Experience

10 yrs 7 mos
Total Experience
2 yrs 1 mo
Average Tenure
2 yrs 1 mo
Current Experience

Nielsen

2 roles

MTS-3

Promoted

Oct 2025Present · 7 mos · On-site

Data EngineeringAWSApache Airflow

Senior Software Engineer

Mar 2024Sep 2025 · 1 yr 6 mos · On-site

  • I spearhead the development and maintenance of ETL (Extract, Transform, Load) pipelines for Nielsen's Audience Measurement domain. My key responsibilities and accomplishments include:
  • Architecting and implementing sophisticated data fusion processes using Apache Airflow on AWS cloud services, ensuring seamless integration and efficient data flow.
  • Developing robust and scalable solutions by leveraging a diverse tech stack, including Scala, Docker, Python, Shell scripting, Scala Spark, and PySpark. This comprehensive approach enables us to handle complex data processing tasks effectively.
  • Optimizing the use of AWS services such as EMR, EC2, Lambda, S3, ECR, ECS, and CloudWatch to enhance data processing efficiency and reliability.
  • Designing and refining ETL applications for various data cycles - daily, monthly, and yearly - to meet evolving business needs and improve overall system performance.
  • Utilizing Databricks to perform advanced analytics and implement machine learning tasks, driving data-driven insights and predictive capabilities.
  • Implementing Agile methodologies for data generation, prediction, and analysis, fostering a flexible and responsive approach to project management.
  • Overseeing version control and continuous integration/continuous deployment (CI/CD) pipelines using GitLab, ensuring streamlined development processes and code quality.
  • This role showcases my ability to lead a team, architect complex data solutions, and leverage cutting-edge technologies to drive business value in the field of audience measurement and analytics.
Apache AirflowAWSScalaDockerPythonShell scripting+3

Tata consultancy services

IT Analyst C2

Jul 2022Feb 2024 · 1 yr 7 mos · Indore, Madhya Pradesh, India · Hybrid

  • In this role, I was part of a dynamic team serving Nielsen, a key TCS client, where I made significant contributions to their data fusion processes. My responsibilities and achievements included:
  • Collaborating on the implementation of sophisticated data fusion workflows using Apache Airflow on AWS cloud infrastructure, enhancing data processing capabilities.
  • Actively contributing to the development of robust solutions utilizing a diverse tech stack, including Scala, Docker, Python, Shell scripting, and Apache Spark (both Scala and PySpark).
  • Leveraging various AWS services such as EMR, EC2, Lambda, S3, ECR, ECS, and CloudWatch to optimize data processing and management workflows.
  • Assisting in the maintenance and improvement of ETL applications for daily, monthly, and yearly data cycles, ensuring smooth and efficient data operations.
  • Utilizing the Databricks platform for advanced analytics and machine learning tasks, contributing to data-driven insights and predictive modeling.
  • Actively participating in Agile methodologies for data processing and analysis, fostering a flexible and responsive approach to project execution.
  • Employing GitLab for version control and CI/CD processes, ensuring streamlined development and deployment practices.
  • This experience at TCS provided me with invaluable insights into large-scale data operations and cloud-based solutions, laying a strong foundation for my future roles in data engineering and analytics.
Apache AirflowAWSScalaDockerPythonShell scripting+3

Advantmed

BigData Engineer

Jul 2020Jul 2022 · 2 yrs · Ahmedabad, Gujarat, India · Remote

  • In this project, I played a key role in developing and implementing an advanced ETL pipeline for US healthcare domain clients, leveraging Big Data technologies. Key accomplishments and responsibilities included:
  • Engineered a comprehensive end-to-end ETL pipeline using PySpark, significantly enhancing data processing capabilities for healthcare clients.
  • Developed sophisticated data extraction processes from frontend UI using RESTful APIs, efficiently mapping to metadata and creating Parquet files for optimized storage and retrieval.
  • Dramatically improved data processing efficiency, reducing computation time from 20 hours to just 3 hours through innovative optimization techniques.
  • Designed and executed robust data validation and transformation processes for both fixed-length and CSV files, utilizing JSON metadata for enhanced flexibility and accuracy.
  • Leveraged AWS services, including EMR clusters and Glue, to facilitate seamless cloud migration and processing, ensuring scalability and performance.
  • Implemented advanced batch processing techniques and integrated RabbitMQ for efficient message queuing, improving overall system throughput.
  • Conducted thorough requirement analysis, development, unit testing, and integrated testing phases, ensuring high-quality deliverables at every stage.
  • Successfully integrated MySQL database and implemented data flow mechanisms to generate accurate ratings for US insurance companies.
  • Collaborated effectively within an Agile sprint model, ensuring timely delivery of project milestones and maintaining high team productivity.
  • Technologies Utilized: PySpark, SparkSQL, Python, RabbitMQ, AWS EMR, AWS Glue, PyCharm IDE, MySQL, RESTful APIs
  • This project showcased my ability to work with cutting-edge technologies in Big Data and cloud computing, while delivering tangible improvements in data processing efficiency and system integration for healthcare clients.
PySparkAWS EMRAWS GluePythonRESTful APIsRabbitMQ+2

S.v.n.i.t group of institute,sagar,m.p

Senior System Engineer

Aug 2017Jul 2020 · 2 yrs 11 mos · Sagar, Madhya Pradesh, India · On-site

  • IT Project Coordinator, S.V.N.I.T Group of Institutes
  • In this pivotal role, I served as the primary liaison between S.V.N.I.T Group of Institutes and external development teams, spearheading a critical WebApp project. My responsibilities and achievements included:
  • Led and coordinated Hadoop development initiatives, significantly enhancing the university's data processing capabilities and paving the way for more efficient information management.
  • Conducted comprehensive business analysis to ensure technology solutions were precisely aligned with the institution's needs, fostering a more effective and streamlined IT ecosystem.
  • Managed complex implementation processes, guaranteeing smooth integration of new systems within the existing infrastructure, minimizing disruption to daily operations.
  • Acted as the central point of communication, effectively coordinating project progress and updates across multiple stakeholders, ensuring all parties remained informed and aligned.
  • Played a key role in the strategic planning and execution of IT initiatives within the university, contributing to the institution's technological advancement and competitiveness.
  • Successfully bridged the gap between technical teams and institutional leadership, translating complex technical concepts into accessible language for decision-makers.
  • Key Skills:
  • Hadoop Development
  • Business Analysis
  • Project Coordination
  • Implementation Management
  • Stakeholder Communication
  • Strategic IT Planning
  • This experience showcased my ability to manage complex IT projects, coordinate between diverse teams, and align technological solutions with institutional goals in an academic setting.
HadoopBusiness AnalysisProject CoordinationHadoop Development

Infosys

System Engineer

May 2015May 2017 · 2 yrs · Pune, Maharashtra, India · On-site

  • Developed and maintained public-facing enterprise web applications for AT&T’s telecom-
  • munication sector
  • Participated in JCart, an e-commerce training project to develop a shopping site module
  • Designed and implemented JSPs, entity classes, and Form Bean classes as per project
  • requirements
  • Created and optimized service classes and DAOs using Hibernate for efficient data
  • management
  • Implemented data validation and minor enhancements to improve application performance
  • Developed product persistence functionality for storing and retrieving product details
  • from databases
  • Created dynamic category-based product displays using dropdown menus
  • Implemented features for product deletion and management within databases
  • Collaborated with cross-functional teams to ensure timely delivery of project milestones
  • Key Contributions: Web Application Development, Database Management, E-commerce
  • Solutions, Team Collaboration, Performance Optimization
Core JavaHibernateJSPWeb Application DevelopmentDatabase Management

Education

Jabalpur Engineering College

Bachelor of Engineering — Computer Science

Aug 2011May 2015

jabalpur engineering college

Jan 2011Jan 2015

International Institute of Information Technology Bangalore

Pursuing Executive PG Programme in Machine Learning and Artificial Intelligence — Machine Learning

Feb 2025May 2026

Stackforce found 100+ more professionals with Data Engineering & Aws

Explore similar profiles based on matching skills and experience