P

Preetam Soni

Data Engineer

Mumbai, Maharashtra, India5 yrs 2 mos experience
AI EnabledAI ML Practitioner

Key Highlights

  • Over 3 years of Big Data experience.
  • Certified in AWS, Azure, and Databricks.
  • Expert in building robust data pipelines.
Stackforce AI infers this person is a Data Engineering and Data Science professional specializing in cloud-based solutions.

Contact

Skills

Core Skills

Data EngineeringMachine LearningCloud Computing

Other Skills

AWSApache SparkArtificial Intelligence (AI)AthenaAzureBusiness DevelopmentCCI/CDCloudWatchCustomer Relationship Management (CRM)Customer ServiceData AnalyticsData ScienceDiamond GradingDiamond Jewellery

About

Title: Passionate Data Professional | AWS & Azure and Databricks Certified | Extracting Insights from the Depths of Data 🚀 As a Data Engineer turned Data Scientist, I found inspiration in the immeasurable value of data. With over 3 years of Big Data experience, I'm skilled in Python, SQL, PySpark, AWS, Power BI, Snowflake, and Talend. Successfully delivering POCs and contributing to Cloud migration projects at Wavicle Data Solutions. My data engineering toolkit includes SQL, PySpark, AWS (S3, Redshift), Power BI, Python, Glue, Lambda, QuickSight, CloudWatch, RDS, Sagemaker,and more. I have secondary proficiency in PostgreSQL, MongoDB, DynamoDB, and Azure. Also, Proficient in ML techniques like Regression, Random Forest, SVN, and NLP. Notable projects: Optimizing Time Series Demand Forecasting: Leveraging Scenario Analyzer for Enhanced Accuracy(AWS, Sagemaker, Snowflake, Redshift, S3, Python, PySpark, QuickSight, Glue) Automating Talend to Glue Conversion: Seamless Transformation of Talend Jobs to Glue Jobs with PEP8 Standard Compliance People churn modeling, Tic-Tac-Toe agents using Q-learning, Twitter sentiment analysis, Data lineage network graph development. Certifications: 1. Databricks Certified Associate Developer for Apache Spark 3.0 2. AWS certified Cloud Practitioner. 3. Microsoft certified Azure AI Fundamentals 4. Talend Data Integration Developer Practitioner badge 5. Snowflake SnowPro core Udemy Open to data-related opportunities and collaborations: pritam.soni4949@gmail.com | psoni4@unh.newhaven.edu 📩 Driven by the boundless potential of data, I'm eager to discover transformative insights on my continuous data journey. #DataProfessional #AWS #Talend #DataScience #ML #DataInsights

Experience

5 yrs 2 mos
Total Experience
1 yr 8 mos
Average Tenure
--
Current Experience

Wavicle data solutions

3 roles

Data Engineer

Promoted

Jun 2023Sep 2024 · 1 yr 3 mos · Hybrid

  • As an experienced Data Engineer, I thrive on building robust pipelines in AWS using an array of powerful tools: Lambda, Glue, Redshift, Sagemaker, Step Functions, S3, and RDS. Handling diverse client requirements is my forte, as I carefully understand their needs, provide valuable feedback, and efficiently implement the work.
  • Monitoring data integrity and ensuring smooth operations, I take pride in creating efficient CI/CD pipelines with Jenkins, streamlining development and deployment processes.
  • Additionally, I excel at implementing advanced ML models, such as Demand Forecasting, leveraging Sagemaker's capabilities. Integrating MLOps orchestration with Snowflake, I ensure seamless integration of machine learning solutions into the broader data ecosystem.
  • With a passion for data-driven innovation, I'm committed to delivering excellence in every aspect of data engineering, enabling businesses to harness the true power of their data. 🚀 #DataEngineer #AWS #DataPipelines #MLOps #Sagemaker #Snowflake #Jenkins #DemandForecasting
AWSLambdaGlueRedshiftSagemakerStep Functions+6

Associate Data Engineer

Jun 2022Jun 2023 · 1 yr · Hybrid

  • Developed the data pipeline architecture to link it with the UI, such that any file submitted from the UI directly loads to S3, is processed in Glue, is loaded to Redshift, and is analyzed in Quick sight.
  • Collaborated with the Chief Architect in the ideation and design of a new backend data pipeline, providing key insights and justifications for the selection of resources, and assisted in the development of the pipeline from scratch.
  • Implemented data pipeline automation using event-based triggers with no manual intervention.
  • Communicated with Project managers and analysts about data pipelines that uplifted efficiency.
  • Worked with supervisors to understand business requirements and translate those requirements into actionable reports.
  • Developed and implemented a job scheduling solution that dynamically adjusts to user-provided parameters, resulting in significant cost savings by reducing unnecessary lambda and other resource usage, while optimizing compute power usage.
  • Deployed AWS platform version control to effectively manage and track changes to the organization's cloud infrastructure, ensuring reliability and streamlined collaboration among team members.
  • Developed and implemented efficient and scalable database schemas, including designing, and optimizing tables, to support data storage and processing requirements.
  • Designed and developed an AWS-based backend system that automatically updates information and seamlessly integrates with Microsoft Form using SES, enhancing data accuracy, and streamlining communication processes.
  • Primary: SQL, PySpark, AWS, S3, Hadoop, Hive, Redshift, Power BI, Python, Glue, Lambda, Quick sight, CloudWatch, events bridge, SES, Athena, Aws RDS, Snowflake, Talend, MySQL, Microsoft SQL Server, SQL Lite,
  • Secondary: PostgreSQL, MongoDB, DynamoDB, Azure
SQLPySparkAWSS3HadoopHive+20

Data Engineering Intern

Jan 2022Apr 2022 · 3 mos · Chicago, Illinois, United States · Remote

  • Successfully collected, processed, and loaded data to AWS RDS, which was then utilized to deduce associations between source and destination columns.
  • Implemented and analyzed several approaches, as well as exhibited knowledge of Python libraries and the Pypi package.
  • Queried database to extract all metadata to achieve table to column relations for source and target in SQL scripts.
  • Contributed to UI design and development, exhibiting teamwork and quick learning abilities, and understanding of technologies such as JavaScript, react, Pyvis, and NodeJS.
  • Collaborated with the UI team to incorporate data lineage backend programming.
  • Demonstrated data storytelling through extraction of data insights from different databases/tables.
AWSETLData EngineeringSQLPython

University of new haven

2 roles

Math Zone Learning Assistant(Non Grading Teaching Assistant)

Aug 2021Jan 2022 · 5 mos

  • Preetam specializes in Statistics, Linear Algebra, Trigonometry, Probability, and Geometry.
  • Job Responsibilities:
  • Assisting Professors in five classes with daily lectures.
  • Providing student support to 200 students through Zoom appointments and drop-in Office Hours.
  • Analyzing data from student appointments to enhance teaching experiences.
  • Collaborating with Professors on YouTube videos for passive study.
  • Innovatively presenting topics using online resources like YouTube, Wikipedia, and other online platforms.

Communication intern

Aug 2021Jan 2022 · 5 mos

  • I work on managing all social media accounts LinkedIn , Facebook, website, Instagram.
  • Analysis of the audience, creating content, keywords research, search engine optimization using meta keywords, daily report to director of the institute. Weekly Meetup’s with all management authorities and weekly review about the status.

Book my task services llp

Data Engineer

Feb 2018Aug 2020 · 2 yrs 6 mos · Mumbai, Maharashtra, India

  • ● Hands on Experience of Marketing & Digital Tools like Google Ad Words, Social Ads, Social analytics, CRM.
  • ● Hands on Experience of Social Media platforms (Data Extraction, Social Analytics)
  • ● Use Google Analytics & other analytics software for data mining & analysis.
  • ● Revamped a Business Page on Facebook, that has led to over 35000 followers (up by 40% in 2 months) and has led to a 3% increase in revenue.
  • ● Building Reports in Ms-excel and used Tableau to create Dashboards
  • ● Conducting exploratory analysis on the data and providing actionable insights.
  • ● Analysed and spotted trend of zip code with high cleaning services requirements and cross sold cleaning service with annual repair- maintenance service package with SEM/SMM/PPC social media techniques thus helped in increasing revenue by 3% quarterly.
  • ● Spotted high service requirement day, trend and helped in price fluctuation as per business needs
  • ● Analysed high service demanding Zip codes and made availability of labour in those areas more, on spotted high demanding days.
  • Worked on Spark, Hadoop, map-reduce, aws s3, EMR, big data frameworks

Education

University of New Haven

Master of science in data science — Data science

Jan 2021May 2022

JAIN College

Bachelor’s Degree

Jan 2011Jan 2015

Stackforce found 100+ more professionals with Data Engineering & Machine Learning

Explore similar profiles based on matching skills and experience