Vrishabhdhwaj Maharshi

Software Engineer

Mumbai, Maharashtra, India3 yrs 3 mos experience
Most Likely To Switch

Key Highlights

  • Expert in Big Data Engineering and Data Pipeline Optimization.
  • Proven track record in developing trading strategies and quantitative research.
  • Strong background in Machine Learning and Natural Language Processing.
Stackforce AI infers this person is a Data Engineer with expertise in Fintech and Machine Learning applications.

Contact

Skills

Core Skills

Big Data EngineeringData Pipeline OptimizationData Pipeline DevelopmentGpu ComputingQuantitative ResearchTrading Strategy DevelopmentQuantitative AnalyticsData MiningSurveillance TechnologyMachine Learning EngineeringModel OptimizationMachine LearningNatural Language ProcessingModel DeploymentWeb Development

Other Skills

AWSAWS Kinesis Video StreamsAWS SagemakerAirflowAmazon EKSArduinoAutomatic Text SummarizationAutomationBERT (Language Model)Business Intelligence (BI)C (Programming Language)CUDACascading Style Sheets (CSS)Computer VisionConvolutional Neural Networks (CNN)

About

I am a dedicated professional with a strong academic background and a passion for technology-driven innovation. Holding a B.Tech in ECE and an M.S. in Computational Biology from Jawaharlal Nehru University, I have consistently demonstrated my commitment to excellence. My technical prowess encompasses a wide range of skills. I am proficient in programming languages such as Python, C/C++, Go (Golang). My expertise extends to machine learning tools like Tensorflow, Pytorch, LLMs, Transformers, and BERT, as well as technologies like Docker, AWS, GIT, Jira, and more. In the field of data science, I have honed my skills in Pandas, Numpy, ARIMA, XGBoost, Dask, Spark, and Matplotlib. My professional journey has been marked by valuable experiences. Currently I am a Big Data Engineer at Infinite Analytics. I deal with TBs of data to get insights specific to client requirements. As a Junior Quantitative Researcher at Pace Stock Broking Services Pvt. Ltd., I have taken on the challenge of developing and optimizing trading strategies, along with automating data mining processes. My contributions to low-latency systems and efficient data management reflect my dedication to precision and efficiency. During my tenure as a Machine Learning Intern at Emvirt IoT Edge Solutions, I played a key role in optimizing object detection models for edge devices. My work resulted in significant improvements in accuracy and inference time, highlighting my problem-solving abilities. Additionally, my experience as an NLP Intern at Esya.ai allowed me to delve into natural language processing, intent classification, and text summarization, contributing to a more engaging user experience. In a freelance capacity, I leveraged AWS and Raspberry Pi for Automated Surveillance Systems, showcasing my adaptability and resourcefulness in diverse tech environments. I am equally passionate about research. My projects, such as the analysis of EEG data to study Autism and underwater fish tracking, reflect my commitment to pushing the boundaries of knowledge and innovation. Beyond my technical skills, I have taken on leadership roles, including organizing workshops and excelling in competitions, demonstrating my ability to collaborate and excel in diverse environments. My journey is driven by the pursuit of excellence, a commitment to innovation, and a desire to make a meaningful impact in the world of technology. Let's connect and explore opportunities to collaborate and drive positive change together.

Experience

Infinite analytics

Big Data Engineer

Dec 2023Present · 2 yrs 3 mos · Mumbai, Maharashtra, India · Hybrid

  • Location Data Pipeline: Optimization and architectural modifications to data pipeline ingesting TBs of data using pyspark on OCI/AWS and Kubernetes. Optimized very large dataframe joins by 75% and 85% for India and USA resp. Introduced architectural and procedural changes to pre-existing pipelines, resulting in 100% uptime and reliability. Modified broken data preprocessing stages of the pipeline to improve data write reliability and 60% reduction in execution time. Utilized AWS EMR for quick benchmarking and POCs of alternate pipelines. Contributed in reduction of execution time of all stages of pipeline resulting an improvement of 50% in execution time.
  • Places-of-Interest Data Pipeline: Built a data pipeline from scratch to analyze & generate datasets from proprietary POI data. Optimized dataset generation, for fast joins with Location Data Pipeline. Orchestrated this pipeline using Airflow and managed using Delta format.
  • Very Large Matrix Multiplication: Researched multiplication of very large matrices (100M+ rows) using GPU accelerated computations using CUDA via CuPy, cudf, RAPIDS API for visitation based behavior analysis. Benchmarked various CUDA and lazy-computation libraries. Devised a big data solution by reducing sparse matrices which reduced execution time by 75%.
  • Exploratory Data Analysis: Introduced in-depth criterions of analysis for location data like hourly analysis and teleporting devices.
  • Handled various client requirements with varying complexities. Single-handedly managed analytics for a client with $200M+ revenue to the company.
  • Backend Engineering: Developed and managed the entire backend of the platform using AWS Glue, Presto, Superset, Delta tables. Improved platform performance by 25% by partitioning on query columns.
  • Engineered customer behaviour project using Deep learning via tensorflow & MLOps. Designed REST APIs for the platform.
PySparkCustomer InsightBusiness Intelligence (BI)Microsoft Power BIAmazon EKSSQL+2

Pace stock broking services pvt. ltd.

SDE 1

Dec 2022Dec 2023 · 1 yr · New Delhi, Delhi, India · On-site

  • Developed and tested futures and options trading strategies and analyzed the performance on data of 1080 days using Pandas.
  • Optimized code to reduce runtime by 40%. Used Matplotlib to visualize and plot financial data.
  • Developed pan-India trader database using MongoDB.
  • Automation of data mining from NSE & BSE websites to retrieve useful data with 0.85 second latency.
  • Automated summary generation of Introducers, capital management, and upload data to NSE as Project Manager for developing back-office software.
  • Compiled and analyzed databases with more than 50 million data entries using Big Data Analysis tools like Dask, Apache PySpark.
  • Developed and maintained data-driven solutions for credit management, resulting in improved risk assessment and customer experience.
  • Worked in Machine Learning implementation and supertrend indicator analysis on historical data.
Credit Risk ManagementRabbitMQRedisMicrosoft ExcelTrading SystemsETL+5

Freelance

Freelance Software Engineer

Oct 2022Nov 2022 · 1 mo · Remote

  • Development of an automated surveillance system that utilizes live video streams from raspberry pi and processes them using AWS Kineses Video Streams and AWS Sagemaker.
  • Utilized S3 buckets for storage of surveillance data.
  • AWS SNS was used to notify the user of any unwanted personnel.

Emvirt iot edge

Machine Learning Engineer

Feb 2022May 2022 · 3 mos · Jaipur, Rajasthan, India

  • Optimized ML models and developed machine learning pathways for custom datasets and models (Automated Number Plate Recognition system).
  • Model compression (using quantization and pruning) which reduced inference time from 250ms to 120ms; accuracy 92% in ANPR and 90.4% in YOLOv5.
  • Worked on translating ngrok source code from Golang to Python to transfer and receive docker payloads via tunneling.
  • Deployment of docker images for detection on edge devices (Jetson Nano & Raspberry Pi) with low compute power via SSH tunnel.
  • Streamlined training for a number of datasets.
  • Deployment of Nvidia and custom docker images on edge devices.
  • Engineered a robust pedestrian detection system leveraging TensorFlow, PyTorch and computer vision, enhancing surveillance capabilities on edge devices, drones.
  • Model compression and optimizations using quantization, pruning, Onnx, TensorRT
Go (Programming Language)Python (Programming Language)CUDANVIDIA cuDNNEmbedded SystemsMachine Learning Engineering+1

Esya.ai

Natural Language Processing Engineer

Sep 2021Jan 2022 · 4 mos · Remote

  • Developed a dataset using web scraping and an intent classification model with accuracy 95% for a news recommendation system.
  • Text analysis using Information Retrieval techniques like BM25 and data pipelining using Redis.
  • Deployment of Question answering and Question generation using BERT, ROBERTA from Hugging Face for an interactive user experience.
  • Fine-tuning Large Language Models (LLMs) for a news recommendation website; accuracy improved from 85% to 94%.
Large Language Models (LLM)Large Language Model Operations (LLMOps)Data MiningBERT (Language Model)Automatic Text SummarizationMatplotlib+3

Iha consulting services pvt. ltd.

Web Development Intern

Jun 2021Aug 2021 · 2 mos · Work from home

  • Successfully deployed a Flask-based website with an integrated machine learning model in the backend to automatically predict and display related graphs.
FlaskWeb DesignWeb Development

Medtoureasy

Machine Learning Trainee

Apr 2021May 2021 · 1 mo · India

  • Learned and implemented various libraries and modules of sklearn, Scikit, Numpy, Pandas, Tensorflow, and PySpark.
  • Developed machine learning models for classification, regression, clustering, NLP, Computer Vision, and time-series analysis.

Codespeedy technology private limited

Python programmer

Feb 2021Mar 2021 · 1 mo · India

  • Speech-to-text converter with GUI using Python: The project utilizes the SpeechRecognition module of Python programming language to convert sound or words spoken by the user into text and save it as a text file.
  • RPC implementation using Python: Researched RPC (Remote Procedure Call) and its implementation using Python programming language. Used XML-RPC and gRPC to design and communicate between a server and a client.
  • Graph plotter in Python A Tkinter UI-based Python project to display graphs of the marks of a student.

Education

Jawaharlal Nehru University

Master of Science - MS — Computational Biology

Jan 2022Jan 2023

Jawaharlal Nehru University

Bachelor of Technology - BTech + MTech — Electronics and Communications Engineering

Jan 2018Jan 2023

M.B.M. Engineering College, Jodhpur

B. Tech (First semester) — Electrical Engineering

Jan 2017Jan 2017

Maheshwari Public School

Jan 2005Jan 2017

Stackforce found 80 more professionals with Big Data Engineering & Data Pipeline Optimization

Explore similar profiles based on matching skills and experience