M

Mohit Arora

Product Engineer

Leeds, England, United Kingdom18 yrs experience

Key Highlights

  • 19+ years of experience in Data and Analytics
  • Expertise in Natural Language Processing and Machine Learning
  • Led legacy data migration initiatives successfully
Stackforce AI infers this person is a Data and Analytics Architect with expertise in Cloud and Machine Learning.

Contact

Skills

Core Skills

Machine LearningGoogle Cloud Platform (gcp)

Other Skills

Apache SparkDB2Data WarehousingInformaticaMicrosoft AzureNLTKNatural Language Processing (NLP)OraclePL/SQLSQLSentiment AnalysisSupervised LearningUnixUnix Shell Scripting

About

Experienced campaigner in Data and Analytics with 19+ years of extensive experience in Natural Language Processing, Analytics, Application Development Framework, and Cloud primarily in GCP, Azure and AWS. Having worked as a Solution Provider to business industries encompassing Media and Broadcast Solution,Banking,Healthcare,Retail and Telecom. Spearheaded legacy data migration initiative from RDBMS to HDFS by maintaining the necessary distributed server configurations . Hands on experience on Python on Databricks along with necessary issue handling exposure to Hive and MongoDB database led to spending good amount of time in projects involving Spark Structured Streaming using Kafka . Devised and communicate top-level objectives and measures of success to guide the teams, challenge them, and hold them accountable Drive some of the company-wide tech initiatives by striving towards continuous technical excellence of our platforms

Experience

Capgemini

Enterprise Data Architect

Jan 2024Jun 2025 · 1 yr 5 mos

  • Exhibited technical leadership, guidance, and alignment to drive innovation, scalability, and efficiency across cross-functional teams. Stay updated on emerging technologies and trends, incorporating them into the organization’s technical strategies and Collaborating with product and delivery teams to define scalable, reusable data architecture assets that align with industry-specific needs

Exl

Assistant Vice President

Jun 2022Jun 2023 · 1 yr

  • Drove the analytics space having experience majorly on GCP suites to gain the insight for the business leaders
  • Implemented the Vision API utility of Machine Learning to extract the necessary information from the .pdf files shared . Helped in detecting labels, face ,object detection, Crop hints and explicit content from the documents
  • Extracted the necessary insights needed to be derived from audio using Speech utility to transcript the data recorded from .wav and .mp3 file
  • Gathered the desired information from video files using video intelligence of ML API
  • Extracting the real time streaming data using PubSub to derive the result into BigQuery
  • Identifying the risk and providing the suggestions for mitigations and working with cross-functional teams to diagnose the issues and help in resolving them in defined time-period
  • Helped in designing the Solution Architect for different customers pertaining to GCP and Azure functionalities for both batch and live streaming data
  • Worked on the transformers package of NLP to detect the Sentiment Analysis. Sentiment Intensity Analyser is used to gather polarity scores. Created plots using matplotlib library.
Microsoft AzureMachine LearningGoogle Cloud Platform (GCP)Sentiment AnalysisApache SparkNatural Language Processing (NLP)

Coforge

Solutions Architect

Sep 2020May 2022 · 1 yr 8 mos

  • Solution Expert involved in contributing the technical strategies for application and systems . Collaborates closely with key leaders to develop strategic projects, plans and processes and streamline decision making. Refactored the pipelines to enable the code run in all the environment successfully
  • Lead by example by being hands on and follow the processes and practices that are developed. Work with and advice stakeholders on technical aspects, make well-informed decisions & function well in a fast-paced, rapidly changing environment.
  • Improved processes, technology and applications by showing the team better ways of doing things and help improve skills in the team. Prioritize tech-debt and ensure the platforms and applications meet the latest industry standards.
  • Hands-on architect/manager to lead the streaming & big data platform team to drive business and take it to the next level. Architected, designed, and led the development of Distributed Middleware, Cloud Platforms, and Big data Systems such as distributed Computing, Distributed File Systems, and Distributed Search Engine Platforms
Microsoft AzureNLTKMachine LearningGoogle Cloud Platform (GCP)

Innovaccer

Engineering Manager

Dec 2019Jul 2020 · 7 mos · Noida, Uttar Pradesh, India

  • Worked as a Technical Manager for their product InNote which cater to the Health care industry by keeping track of the number of patients that come across to different Providers in a day. It is a desktop App that was developed using Django as a backend framework and Electron as a frontend framework that helps in catering to different Providers (doctors) to keep account of their patients that come across with different ailments and their recommended care are tracked using this App which works reasonably well on Windows OS

Core compete

Technical Delivery Manager

Aug 2018Nov 2019 · 1 yr 3 mos · Greater Hyderabad Area

  • Managed the complete architect of handling the ingestion of third-party source related to viewership of Quarter hour, half hour program for 4 streams mainly Live, Live+ Same Day, Live +3 and Live +7 data. Necessary files are available in .mit files which is moved to the GCS bucket and finally made available to the BigQuery tables. Datastore tables are also employed for audit purpose. Pub Sub functionality is implemented to enable the streaming data flow in tables

Itc infotech

Data Architect

Sep 2017Aug 2018 · 11 mos · Bangalore

  • Designed the architecture for extracting data from Sales force and made available via files through third party systems mainly IVYS. Cloud framework is utilized to gather the streams of data and moved to necessary target locations by routing through Spark framework for data quality checks.Necessary data is collected at Hive database which is easily partitioned with required clusters. Resultant data is collected through Microsoft Power BI dashboards by making desired reports which can be easily visualized to different business users
  • Worked on creating Datasets and Pipelines in ADF(Azure Data Factories) by making merge scripts for Hive and finally converting the resultant data into flat files to be made available to Power BI for creating necessary dashboards and reports by storing necessary data in Azure Data Lake store.
  • Implemented scala coding to help in data cleansing using Spark framework by creating .JAR files
  • in order to create Dataframes by making use of Spark SQL
  • Stored the data in Hive External tables by creating ORC format files with necessary partitions and
  • desired buckets to enable Merge operations for necessary Dimensions and Facts
  • Worked on Apache Kafka to gather the stream of JSON file data that is stored into Mongodb
  • database using Kafka Direct streams to be integrated with Spark streaming

Cognizant technology solutions

Senior Associate

Jun 2012Sep 2017 · 5 yrs 3 mos · Bangalore, India

  • Managed organization’s on premise platform to standardize data analytics services that further served upstream clients across multiple locations.
  • Worked on Spark framework by creating spark SQL and storing data into their DataFrames and Datasets an also worked on the scala language performing variances using covariance ,counter variance and invariance analysis on it
  • Gathered the continuous stream of data from Apache Kafka and integrating with Spark streaming via Spark Streaming Receiver in order to produce continuous stream of data using DStream.
  • Developed SMS functionality for notifying on-call support teams regarding job failures by implementing web service consumer transformation in Informatica resulting in efficient utilization of support resources.

Tata consultancy services

IT Analyst

Sep 2010Jun 2012 · 1 yr 9 mos · Gurugram, Haryana, India

  • Designed end-to-end data and analytics architecture for credit management application for relationship managers to achieve actionable insights on improving clientele services
  • Improved system efficiency by 500% by re-architecting ETL process and reducing system overheads, and reducing jobs refresh cycle from 40 minutes to 8 minutes.
  • Analyzed existing ETL processes with flat file source data moving into Oracle data warehouse and enhanced data integrity logic to improve data quality by 33%

Infosys technologies limited

Senior Software Engineer

Jul 2006Sep 2010 · 4 yrs 2 mos

  • Developed the messaging engine to receive alerts from the customer alerts system. The PowerCenter services used MQs to communicate with the messaging engine, and maintain transnational history and retrieval validation logic in Oracle database environment.
  • Created the detailed design and component design documents of the various modules.
  • Partnered with Business Analyst to capture and analyze business requirements
  • Developed Informatica ETL workflows to implement messaging system

Education

Institute of Technology and Management

B.E. — Information Technology

Jan 2002Jan 2006

Modern Vidya Niketan

Jan 1997Jan 2001

Stackforce found 100+ more professionals with Machine Learning & Google Cloud Platform (gcp)

Explore similar profiles based on matching skills and experience