Paras Varshney — AI Researcher

As a seasoned Data Scientist and Machine Learning Engineer, I specialize in leveraging artificial intelligence to unravel complex patterns and predict future trends. My academic journey includes a Master's in Data Analytics Engineering from Northeastern University, where I honed my skills in machine learning and predictive analytics. I am adept at analyzing vast datasets, developing robust algorithms, and implementing deep learning workflows, with a proven track record of exceeding accuracy targets. My practical experience extends across various domains, particularly in the application of AI and ML to environmental conservation, with a focus on underwater mammal research. I have successfully developed machine learning models for classifying underwater sounds with impressive accuracy and precision, demonstrating my ability to apply data science principles to real-world challenges. I am proficient in programming languages such as Python, R, SQL, and Java, with a deep understanding of big data tools like Hadoop, Apache Spark, and cloud platforms like AWS and GCP. My expertise in natural language processing, statistical modeling, and data visualization complements my proficiency in advanced machine learning frameworks like TensorFlow and Scikit-Learn. My commitment to continuous learning is reflected in my active participation as a Global Data Science Ambassador, where I contribute to organizing and executing data science competitions on a global scale. This leadership role has not only allowed me to network with peers and industry leaders but also positioned me as a thought leader in the data science community. With a Bachelor's degree in Computer Engineering and a Master tier achievement on Kaggle, I bring a blend of theoretical knowledge and practical experience to the table. My collaborative approach and dedication to innovation make me a valuable asset to any team seeking to harness the power of data to drive strategic decision-making and operational excellence. Languages: Python, SQL, Java, R, C++, Rust, Git, Matlab, MySQL, NoSQL, Spark, Unix, bash/shell scripting Applied CS: Data Science, Machine Learning, Deep Learning, Big Data, Full Stack, Computer Engineering, Data Engineering, Design Patterns, Computer Vision, NLP Tools and Frameworks: Docker, Kubernetes, Business Intelligence, Maven, UI, RESTful API, Web Services, Hibernate, GCP, PyTorch, MLOps, DevOps, GitHub, Jira, Confluence, Snowflake, Data Lake, Kafka, Map Reduce, Hadoop, Hive, ETL (Pandas, NumPy, scikit-learn, Keras, PyTorch, TensorFlow, OpenCV, Matplotlib, Seaborn)

Stackforce AI infers this person is a Data Science expert with a focus on Environmental Conservation and Fintech.

Location: Boston, Massachusetts, United States

Experience: 6 yrs 7 mos

Skills

Data Science
Machine Learning
Leadership
Creative Writing
Android Development

Career Highlights

Expert in machine learning and predictive analytics.
Proven track record in environmental conservation projects.
Active leader in global data science initiatives.

Work Experience

Fidelity Investments

Senior Data Scientist (1 yr 11 mos)

Beyond Limits

Data Science Intern (2 mos)

Northeastern University

Research Assistant (1 yr 4 mos)

LogicAI

Data Scientist (11 mos)

Indian Institute of Science (IISc)

Data Scientist (1 yr)

HP

Global Data Science Ambassador (3 yrs 2 mos)

OneLXP by illumnus®

Data Science Engineer (3 mos)

Medium

Technical Writer (1 yr)

OneLXP by illumnus®

Android Developer Intern - AI (6 mos)

Magicbricks

Android Developer Intern (2 mos)

Education

Master's degree at Northeastern University

Bachelor's at Indian Institute of Information Technology Design & Manufacturing, Kurnool

Paras Varshney

AI Researcher

Boston, Massachusetts, United States6 yrs 7 mos experience

Most Likely To SwitchAI Enabled

Key Highlights

Expert in machine learning and predictive analytics.
Proven track record in environmental conservation projects.
Active leader in global data science initiatives.

Stackforce AI infers this person is a Data Science expert with a focus on Environmental Conservation and Fintech.

Contact

paras.varshney97@gmail.com LinkedIn

Skills

Core Skills

Data ScienceMachine LearningLeadershipCreative WritingAndroid Development

Other Skills

AIAcousticsAnalyticsArchitectural DesignArtificial Intelligence (AI)Audio EngineeringAzure Data LakeAzure DatabricksAzure DevOps ServicesAzure SQLBusiness AnalysisBusiness Intelligence (BI)C++CampaignsCloud Security

About

Experience

6 yrs 7 mos

Total Experience

1 yr 3 mos

Average Tenure

1 yr 11 mos

Current Experience

Fidelity investments

Senior Data Scientist

Jun 2024 – Present · 1 yr 11 mos · Boston, Massachusetts, United States · Hybrid

Building and managing cutting-edge fraud risk technologies utilizing machine learning models. These models analyze vast datasets – internal customer behavior and external threat intelligence – to identify and prevent diverse financial scams. Statistical optimization continuously refines these models, ensuring they stay ahead of evolving fraud tactics.
Our responsibility extends beyond individual accounts. We oversee cyber fraud capabilities across all Fidelity business lines, encompassing Wealth Management, Retirement plans, Digital Assets, and more. This holistic approach guarantees a secure environment for your financial information, regardless of your investment choices.
Our data science team fosters a collaborative environment. We deploy efficient machine learning workflows, even on resource-constrained platforms. By performing feature engineering to enhance existing systems, we leverage signals from supervised and unsupervised learning methods (like anomaly detection) to identify suspicious activity. Through collaboration with other teams, we provide data science support for all fraud-related issues at Fidelity.

Machine LearningStatistical OptimizationData AnalysisFeature EngineeringData Science

Beyond limits

Data Science Intern

Jun 2023 – Aug 2023 · 2 mos · Los Angeles, California, United States · Hybrid

1. I spearheaded a scientific research project focused on improving invoice processing efficiency by leveraging Form Recognizers. Through meticulous experimentation and analysis, I successfully implemented real-time account code suggestions, resulting in a remarkable 71% reduction in processing time compared to manual coding. This achievement, facilitated by my expertise in product mapping and artificial intelligence, revolutionized our invoice processing workflows, enhancing overall productivity and accuracy. Additionally, I worked on a GUI based on Streamlit for real-time analysis of natural language processing data, enabling deeper insights into vendor and client behavior and further contributing to the 71% processing time savings.
2. Additionally, I played a pivotal role in reducing prediction error margins by 0.6x through the strategic application of natural language processing (NLP) strategies and hierarchical classification programming. Leveraging my proficiency in data science methodologies, I implemented innovative approaches to enhance model observability and accuracy, ultimately optimizing our predictive analytics capabilities.
3. Furthermore, I led the implementation of a streamlined ERP invoice processing system, which significantly improved accuracy and efficiency in account code suggestion while eliminating the need for manual Chart of Accounts (CoA) lookup. Through meticulous documentation and collaboration with cross-functional teams, I ensured seamless integration of this system, enhancing data processing pipelines and driving operational excellence within the organization. Additionally, my efforts in optimizing the invoice data pipeline, utilizing technologies such as Oracle databases and AWS S3 infrastructure, resulted in a substantial 71% reduction in processing time, further streamlining operations, and improving overall efficiency.

Invoice ProcessingNatural Language ProcessingStreamlitData Pipeline OptimizationData ScienceMachine Learning

Northeastern university

Research Assistant

Feb 2023 – Jun 2024 · 1 yr 4 mos · Boston, Massachusetts, United States · On-site

1. As a Research Assistant at Northeastern University, I developed and implemented a cutting-edge machine learning solution for analyzing underwater mammal sounds across various frequency bands. Utilizing spectrogram analysis and signal processing, I achieved 95% accuracy, reducing clutter in 20 million sound samples. Expertise in PCA, t-SNE, and audio recognition enhanced results.
2. Additionally, I played a pivotal role in crafting a sophisticated geospatial localization map tailored for real-time tracking of underwater mammals. Leveraging my expertise in data analytics and machine learning, I meticulously engineered a comprehensive mapping tool that seamlessly integrated data from acoustic and GPS sensors. By employing advanced statistical analysis techniques and machine learning algorithms, I ensured the delivery of precise monitoring capabilities essential for extracting advanced research insights. This collaborative effort with esteemed researchers from MIT not only showcased my proficiency in data science methodologies but also demonstrated my ability to translate complex technical concepts into practical, impactful solutions that drive scientific exploration forward.
3. Assisted in redesigning an acoustic array for underwater mammal sound data analysis, implementing state-of-the-art machine learning solution for clustering species, and achieving a silhouette coefficient of 0.983. Developed a real-time Streamlit GUI for multi-processing analytics on 200M samples for 4k–500k frequency bands. Collaborated closely with researchers from MIT for data acquisition at the Gulf of Mexico expedition. This resulted in securing $4 million in funding for building a v2 with 6 fold subsection array.

Machine LearningSignal ProcessingStatistical AnalysisGeospatial MappingData Science

Logicai

Data Scientist

Sep 2021 – Aug 2022 · 11 mos · Warsaw, Mazowieckie, Poland

Mentored teams in the "KaggleDays x Z by HP Global Championship," showcasing leadership and decision-making skills within an agile team environment. This mentorship led to a notable 30% increase in participant engagement, translating into enhanced collaboration and knowledge sharing within the data science community. Additionally, by orchestrating 13 Kaggle competitions, we expanded the reach and impact of our initiatives, further solidifying LogicAI's position as a leader in data science innovation. Through these initiatives, we not only fostered a culture of continuous learning and improvement but also established valuable connections within the industry, paving the way for future collaborations and advancements in data science research.

MentorshipKaggle CompetitionsLeadershipData Science

Indian institute of science (iisc)

Data Scientist

Sep 2020 – Sep 2021 · 1 yr · Bengaluru, Karnataka, India

1. Engineered a dynamic, stateless retraining strategy, enhancing model adaptability across real-time transportation datasets. This strategy not only boosted model performance but also optimized the MLOps architecture, resulting in an estimated 20% reduction in operational costs. By building a robust data warehouse catalog, we streamlined data access and management, further amplifying cost savings and enabling scalability for future growth.
2. Collaborated closely with backend engineers to develop an ML platform for training and inference, integrating CI/CD pipelines for seamless deployment. Through the optimization of transport data governance using Databricks, we achieved a notable 20% reduction in congestion, translating into substantial cost savings for transportation agencies. Additionally, by enhancing MLOps efficiency by 15% and accelerating the ITMS forecasting pipeline by 1.5 times, we effectively expanded the project scope to handle larger datasets and more complex forecasting scenarios, ultimately driving additional value for stakeholders.
3. Spearheaded the development of an open-source ITMS forecasting data ingestion pipeline, processing 900k data points per minute from 5 smart cities. This initiative resulted in a remarkable 1.6x increase in real-time throughput, enabling city administrators to make more informed decisions with timely insights. The cost savings associated with this increased efficiency amounted to an estimated $X annually, demonstrating the tangible impact of our work on the bottom line.

Model AdaptabilityData GovernanceMLOpsData ScienceMachine Learning

Hp

Global Data Science Ambassador

Aug 2020 – Oct 2023 · 3 yrs 2 mos · Denver, Colorado, United States · Remote

Spearheading as one of the 16 Global Data Science Ambassadors for HP, I strategically integrate cutting-edge hardware and software solutions to drive transformative outcomes in data science workflows. Through my extensive experience and expertise, I delved into collaborations with industry leaders like NVIDIA on building cutting-edge GPU technology that can benefit data scientists and researchers, optimizing data processing tasks with advanced hardware accelerators, and ensuring peak performance in data-driven initiatives.
As a distinguished member of an elite cohort of Global Data Science Ambassadors representing Z by HP, I assumed a pivotal role in shaping thought leadership and fostering innovation on a global scale. By delivering captivating talks and presentations at prestigious events such as the GTC AI conference and ODSC West, I not only showcased the latest advancements in AI and data science but also facilitated meaningful collaborations and knowledge exchange among peers and industry stakeholders.
Moreover, my passion for nurturing the next generation of data scientists extended beyond professional boundaries. I took pride in initiating various community events and mentorship programs, driven by a heartfelt commitment to empowering aspiring talent worldwide. Within the dynamic Kaggle community, I championed initiatives that offered students invaluable opportunities to learn directly from industry veterans. These efforts weren't just about imparting technical skills; they were about fostering a supportive environment where individuals could thrive, grow, and ultimately contribute meaningfully to the ever-expanding field of data science. My active involvement in mentoring and guiding budding data scientists not only enriched their learning experiences but also left a lasting impact on me, reinforcing the importance of mentorship and collaboration in shaping future leaders in the industry.

Data ScienceCollaborationPublic SpeakingMentorshipLeadership

Onelxp by illumnus®

Data Science Engineer

Jun 2020 – Sep 2020 · 3 mos

Working as one of the core members of the founding team of AI Research and Development at Illumnus. Developing a smart Ai powered ecosystems by leveraging the power of Machine Learning and Deep Learning in our products focusing on collaborative based learning integrating institutions across various nations.

Machine LearningDeep LearningNatural Language ProcessingData Science

Medium

Technical Writer

Apr 2020 – Apr 2021 · 1 yr · India · Remote

Author at TDS, writing articles on Statistics, Machine Learning, and Data Science with around 10 -15k montly active readers. Link: https://blurred-machine.medium.com

StatisticsMachine LearningCreative WritingData Science

Onelxp by illumnus®

Android Developer Intern - AI

May 2019 – Nov 2019 · 6 mos · Jaipur Area, India

Illumnus is building a SaaS-based cloud ecosystem for educational institutes on the Android platform for Real-time Collaboration. Implemented Search Engine, Optimized Push Notifications using GCM & Firebase, Assignment Creating, Submission & Grading, Optical Bar code Detector, Performance Data Analysis & Administration Visualizations. Built a Recommendation System for teachers to get content sharing recommendations for students based on their teaching pattern.

Android DevelopmentData AnalysisData Science

Magicbricks

Android Developer Intern

May 2018 – Jul 2018 · 2 mos · Noida Area, India · On-site

At Magicbricks I have worked as an Android Developer Intern mentored by Mr. Dhiraj Kumar. Contributed to the Text Recognition Classified in Magicbricks Android Application. Major work fields include Optical Character Recognition API, Location Tracking, Map Clustering, AutoComplete API, Canvas Animations, Battery Usage Optimization, Android Permissions library, Deep Link and Firebase.

Android DevelopmentOptical Character Recognition