Rituraj Kumar

Data Engineer

Tallinn, Harjumaa, Estonia8 yrs 1 mo experience
AI ML PractitionerAI Enabled

Key Highlights

  • Led development of scalable data analytics platform.
  • Achieved 85-90% cost reduction in data processing.
  • Implemented real-time streaming for enhanced user engagement.
Stackforce AI infers this person is a Data Engineering and MLOps expert in Fintech and SaaS industries.

Contact

Skills

Core Skills

Data EngineeringData WarehousingMlops

Other Skills

AI modelsAWS RDSActive ListeningAirflowAmazon DynamodbAmazon SagemakerAmazon Web Services (AWS)Analytical SkillsApache SparkApigee API ManagementAsset ManagementBig DataBigQueryBitbucketBusiness Intelligence (BI)

About

Accomplished Data and MLOps Engineer with over 7 years of experience driving innovative data solutions for large enterprises and leading startups. Specializing in MLOps and DataOps, I have a proven track record of spearheading large-scale projects that integrate advanced analytics and machine learning to drive substantial business transformations. In my current role at Zeals, I orchestrated the design and implementation of a recommendation engine and offer tracking system, significantly enhancing chatbot interactions and user engagement. I also led the development of a scalable data analytics platform that integrates multiple sources for real-time insights, achieving a 10x improvement in processing speed and reducing infrastructure costs by 85-90%. Additionally, I championed the implementation of end-to-end training and prediction pipelines using MLOps principles, ensuring efficient deployment and maintenance of machine learning models. My industry experience spans fintech, consumer products, and retail, where I am passionate about delivering data-driven strategies that empower management teams with actionable insights and drive strategic advancements. Key strengths: ✔️ Strategic planning and leadership in data engineering and MLOps ✔️ Building data warehouses using modern cloud platforms and technologies ✔️ Creating and automating data pipelines, real-time streaming & ETL processes ✔️ Proficiency in creating intuitive dashboards and implementing machine learning models ✔️ Skilled in data cleaning, processing and data migration ✔️ Data strategy advisory & technology selection/recommendation Technologies I frequently work with: ☁️ Cloud Platforms: GCP, AWS, Azure 👨‍💻 Databases: BigQuery, Redshift, Snowflake, RDS, PostgreSQL, MySQL, S3, MongoDB ⚙️ Data Engineering Tools: BigQuery, Airflow, dbt, Hadoop, Snowflake, GCP Dataflow, ETL, Data Lake, Data Warehouse, Data Quality, Data Governance, Data Security 🤖 MLOps Tools: Kubeflow, Kubeflow Pipelines, Vertex AI, Sagemaker, Vertex Pipelines 📊 Data Visualization: Tableau, Looker Studio (Google Data Studio), Power BI 🚀 Containerization and Orchestration: Docker, Kubernetes 🔄 Streaming and Search: Kafka, Elasticsearch, Real-Time Streaming, Flink 💻 Programming Languages: Python, SQL, PySpark 🛠️ Frameworks and Libraries: Flask, Dagster, Terraform 🗂️ Version Control and CI/CD: Git, GitHub Actions, GitLab 🔧 Other Technologies: NoSQL, ComfyUI, Stable Diffusion, GCP Vertex Pipelines, BigQuery ML ,gRPC 👥 Leadership and Project Management: Team management, Project Coordination

Experience

8 yrs 1 mo
Total Experience
3 yrs 4 mos
Average Tenure
1 yr 4 mos
Current Experience

Nabuminds

DataOps Engineer

Feb 2025Present · 1 yr 4 mos · Tallinn, Harjumaa, Estonia · Hybrid

Tifin ag

Senior Data Engineer

Sep 2024Dec 2024 · 3 mos · Bengaluru, Karnataka, India · Hybrid

  • ✔️ Designed and implemented the Data Warehouse architecture from the ground up for financial asset management use cases, supporting data scalability and performance for 100+ million records across multiple datasets.
  • ✔️ Developed and integrated a data loader layer to ingest data from 2+ diverse data sources into Snowflake’s raw layer, reducing data ingestion time by 30%
  • ✔️ Created and maintained 20+ dbt models for transforming data across layers (raw, staging, RDWH) and building 10+ customer-specific business data marts, enabling faster and more accurate reporting.
  • ✔️ Collaborated with the CFO and VP to align data solutions with business goals, driving product scalability and increasing customer adoption through improved analytics capabilities.
Data Warehouse architectureSnowflakeData loader layerdbt modelsData transformationCollaboration with CFO+2

Zeals co., ltd.

Senior Data & MLOps Engineer

Apr 2021Jun 2024 · 3 yrs 2 mos · Tokyo, Tokyo, Japan · Remote

  • ✔️ Designed and developed a Data Analytics Platform to analyze user data for personalized interactions and campaign optimization.
  • ✔️ Engineered a robust data pipeline using dbt, GCP BigQuery, and Airflow to process and analyze TBs of data weekly, improving data retrieval times by 50% and enabling more accurate predictive modeling.
  • ✔️ Optimized the data processing pipeline, achieving an 85-90% reduction in costs and processing time, and a 10x speed increase.
  • ✔️ Developed a Vertex AI Pipelines-based production pipeline for ML use cases, enhancing training efficiency by 50% and availability by 40%.
  • ✔️ Developed and optimized scalable data marts using ETL pipelines in Python, SQL, and Spark, reducing query times by 35% over a period of 12 months.
  • ✔️ Implemented real-time streaming pipelines using PySpark for a recommendation engine, leading to a 20% increase in user engagement and a 15% boost in conversion rates by delivering personalized offers in real-time.
Data Analytics PlatformdbtGCP BigQueryAirflowVertex AI PipelinesReal-time streaming+2

Quantiphi

3 roles

Senior Data & Platform Engineer

Promoted

Jan 2020Apr 2021 · 1 yr 3 mos

  • ✔️ Developed and deployed a healthcare analytics platform and data warehouse on GCP Cloud, utilizing BigQuery for data storage and analytics, and Dataflow for efficient data processing.
  • ✔️Achieved a 60% improvement in data accessibility and reduced processing time by 70% through optimized data pipelines.
  • ✔️ Utilized data engineering tools - Airflow for workflow management and dbt for data transformation on GCP and Provided actionable insights to healthcare professionals, resulting in a 30% enhancement in Business KPIs delivery efficiency.
  • ✔️ Collaborated with data scientists to enhance AI models for predictive analytics in patient outcomes and personalized treatments.
  • ✔️ Integrated AI models into production using GCP AI Platform and TensorFlow, leveraging Vertex Pipelines for machine learning operations, which improved prediction accuracy by 20%. Used Federated Learning for cross-country data analysis and model training to ensure data sensitivity and governance, utilizing Nvidia Clara for enhanced data security and compliance.
Healthcare analytics platformGCP CloudBigQueryDataflowAI modelsTensorFlow+2

Data Engineer

Sep 2018Jan 2020 · 1 yr 4 mos

  • ✔️ Enhanced business insights through advanced data engineering techniques, ensuring robust data management and real-time insights delivery for marketing analytics use cases, leading to a 20% increase in sales efficiency and more effective targeting by sales teams.
  • ✔️ Created a scalable GCP data warehouse, optimizing data accessibility and analytics capabilities for managing large datasets effectively.
  • ✔️ Optimized data processing and analysis workflows with GCP services and PySpark, improving operational efficiency and decision-making.
  • Implemented CRM solutions integrating GA360, Salesforce Marketing Cloud, and other data sources to enhance user profiling and targeted marketing strategies, resulting in a 15% increase in conversion rates.
  • ✔️ Contributed to enhanced customer satisfaction by leveraging data-driven strategies and technologies effectively in business operations.
GCP data warehousePySparkCRM solutionsGA360Salesforce Marketing CloudData Engineering

Platform Engineer

Sep 2017Sep 2018 · 1 yr

  • ✔️ Designed and implemented a GCP-hosted microservices platform for Speech and Recognition analytics, ensuring secure and efficient resource access.
  • ✔️ Collaborated with a Data Scientist to improve speech recognition accuracy, driving business growth and customer satisfaction.
  • ✔️ Developed backend services integrating analytics KPIs, such as user interaction metrics and speech analytics use cases, providing useful data features for enhancing model performance and operational efficiency, resulting in a 25% increase in speech recognition accuracy.
  • ✔️ Deployed data workflows and pipelines on GCP, reducing data processing time by 30% and accelerating model training cycles.
  • ✔️ Demonstrated expertise in data engineering, MLOps, and cloud infrastructure to deliver impactful solutions aligned with business objectives.
GCP-hosted microservices platformSpeech and Recognition analyticsData workflowsModel training cyclesData EngineeringMLOps

Mobivend logistics solutions pvt ltd

Intern

May 2016Jun 2016 · 1 mo · Banglore,India

  • I developed a working model for kiosk machine using arduino and hardware needed for the development along with open source tools like C,PHP, MySQL and PHPWord Library.

Education

Vellore Institute of Technology

Bachelor of Technology (B.Tech.) — Information Technology

Jan 2013Jan 2017

DON BOSCO ACADEMY

ISC-12TH — Science with Economics

Jan 2011Jan 2013

Don Bosco Academy

ISCE-10TH

Jan 2007Jan 2011

Stackforce found 100+ more professionals with Data Engineering & Data Warehousing

Explore similar profiles based on matching skills and experience