Marius Vileiniškis, PhD

AI Researcher

Vilnius, Vilniaus, Lithuania15 yrs 10 mos experience
Most Likely To SwitchAI Enabled

Key Highlights

  • Over a decade of experience in data science and machine learning.
  • Expert in building scalable data pipelines and predictive models.
  • Passionate mentor guiding the next generation of data scientists.
Stackforce AI infers this person is a Data Scientist with expertise in Machine Learning and Data Engineering across various industries.

Contact

Skills

Core Skills

Machine LearningData EngineeringArtificial Intelligence (ai)MentorshipData Management

Other Skills

DatabasesRetrieval-Augmented Generation (RAG)AirflowLangChainLarge Language Models (LLM)PythonSQLMentoringData ScienceCareer CounselingMatlabC++StatisticsSASProgramming

About

I ❤️ 🐍 & open source technologies Based in Lithuania, but working in the United States of Data Data Scientist | Machine Learning Engineer | AI Engineer Passionate about driving data-driven innovation and delivering solutions that make an impact. With over a decade of hands-on experience, I've honed my skills in data engineering, machine learning, and MLOps. My expertise spans from designing and building data pipelines that scale to developing and deploying predictive models that deliver measurable business value. Key strengths: Machine Learning: Applying advanced algorithms (e.g., scikit-learn, PySpark ML) to solve complex problems and extract valuable insights. Data Engineering: Building robust data pipelines using tools like Airflow, PySpark, and Apache Beam to ingest, transform, and store data from diverse sources. MLOps: Ensuring model reliability and efficiency through continuous monitoring, retraining, and deployment using platforms like SageMaker, IBM Watson, and GCP AI Platform. Cloud Platforms: Proficient in GCP and IBM Cloud, with experience in AWS and Azure. Mentorship: Inspiring and guiding aspiring data scientists to develop their skills and contribute to meaningful projects. Technologies / Skills: Python, Airflow, SQL, Docker, GitOps, LangChain Industries: banking, retail, IT, technology consulting

Experience

15 yrs 10 mos
Total Experience
2 yrs 5 mos
Average Tenure
5 yrs 2 mos
Current Experience

Women go tech

Career Mentor

Mar 2023Oct 2023 · 7 mos

  • I have mentored 2 women that wants to change their career paths and break into the data field. During the mentoring process we went through what's relevant in the starting point and how to build a portfolio of projects that would prove the skills necessary for data analyst or data scientist are already there. The mentees started working on one selected personal project which helped making the time spent together more productive.
Data ScienceMentoringCareer CounselingMentorship

Majid al futtaim

Staff Data Scientist

Oct 2021Present · 4 yrs 7 mos · Remote

  • Built a solution to optimize back-of-house workforce in Carrefour hypermarkets in UAE
  • Overhaul of a solution to optimize checkout counters workforce in Carrefour stores across 5 markets
  • Upgraded 2 year old Airflow (1.10.11 -> 2.7.0) and Superset (0.36 -> 3.0.1) instances
  • hosted on EC2. Afterwards, I worked with the DevOps team to migrate them to EKS
  • ensuring seamless user transition.
  • Developed a chat bot / AI assistant for personalized recommendations using Chainlit, Langchain,
  • Azure Open AI services by employing RAG based methodology.
  • Developed an internal AI Academy course to equip colleagues with the skills to build AI assistants with RAG based methodology
  • Developed backend for personalisation on the SHARE Rewards app (1 million+ downloads), which serves dynamic personalised content based on users transactions, followed brands, interests and in-app activity
DatabasesRetrieval-Augmented Generation (RAG)AirflowLangChainLarge Language Models (LLM)Python+4

Kaunas university of technology

Student Mentor

Mar 2021Present · 5 yrs 2 mos · Kaunas, Kauno, Lithuania

  • I'm a mentor for students interested in data analytics related studies in SKILLed FinTech programme. Currently mentoring 3 students, which are in different phases of their university journey. Giving out advice on career choices, job search, sharing my technical experience
  • https://students.ktu.edu/people/marius-vileiniskis/
  • https://studentams.ktu.edu/people/marius-vileiniskis/
MentoringMentorship

Mambu

Senior Data Scientist

Oct 2020Oct 2021 · 1 yr · Vilnius, Vilniaus, Lithuania · Hybrid

  • Enabling sales organisation to understand infrastructure costs to run Mambu SaaS cloud banking platform in a cloud agnostic setup (GCP, AWS, Azure, Oracle) to maintain the desired sales margin when individually pricing offers to the clients
  • Established team workflows using GCP services including Dataflow, BigQuery, Cloud Composer and Vertex AI to collect, analyse, optimise and predict cloud cost data in cloud agnostic setup
DatabasesAirflowPythonSQLMachine LearningData Engineering

Ibm

Senior Data Scientist - Machine Learning Engineer @ Data Science and AI Elite team

Jan 2019Oct 2020 · 1 yr 9 mos · London, United Kingdom

  • Client facing role with a big technical emphasis
  • Predicting anomalies in offshore electric cables:
  • Prevent outages of offshore windfarms by enabling the engineers to perform predictive maintenance on electric cables buried on the seabed by detecting the degrading state from acoustic and temperature readings
  • Predicting the no-show passengers:
  • Enabling the team to perform Big Data analysis using Watson Studio and Spark to handle large data processing pipelines. The challenge was to move from in-memory computing with Python to a highly parallelised solution with Spark
  • Detecting fraudulent credit card transactions:
  • Helping the client to prevent both financial and credibility losses due to the fraudulent transactions being validated as non-suspicious and thus authorised. In order to achieve this a machine learning model to detect
  • whether a transaction is fraudulent or not for card not present transactions was built
  • Data Science Platform on IBM Cloud:
  • Developing a data science platform based on IBM Cloud services including Watson Studio, Watson Machine Learning, Analytic Engine, Kubernetes and OpenScale. Establishing the best practices of delivering data science projects using the built platform by clearly separating the roles and responsibilities of data science, developer and operations teams
  • AI fairness and explainability capability demonstration of Watson OpenScale:
  • An extensive PoC to demonstrate existing capabilities in Watson OpenScale for explainability, bias detection, data drift and other topics on a variety of common use cases in financial sector
  • Anomaly detection in VoIP testing framework:
  • Design and implement an anomaly detection process with most likely root causes for VoIP call tests based on the packet captures (PCAP) files. Building a dashboard with Dash in order to allow test automation engineers to analyse historical root causes, debug each test ran on a system on an application layer (SIP messages) with helpful visuals and proposed root causes
DatabasesMentoringPythonSQLMachine LearningData Engineering

Hometogo

Senior Data Scientist

Oct 2018Dec 2018 · 2 mos · Vilnius

DatabasesPythonSQLMachine Learning

Danske bank lithuania

Senior Data Scientist

Sep 2016Oct 2018 · 2 yrs 1 mo · Vilnius, Vilniaus, Lithuania

  • Projects:
  • Fraud detection for international payments:
  • Developing machine learning models to extend the rule based fraud detection solution
  • AML adverse media screening tool:
  • NLP + machine learning to screen new bank customers for adverse media on money laundering. Project has been a runner up for most innovative automation project in Shared Services & Outsourcing Excellence Awards
  • Collateral management email automation:
  • NLP + machine learning to automate collateral processing
  • Risk score modelling experimentation system on PySpark + Django
  • A tool that generates PySpark code from an exported FICO software rule model to expedite the simulation time of tuning rule thresholds up to 20x
DatabasesPythonSQLMachine Learning

University of nottingham

2 roles

Research Associate in System Reliability Modelling

Promoted

Oct 2014Aug 2016 · 1 yr 10 mos

  • Industrial project: A People Centred Approach to Intelligent, Proactive, Predictive (PCIPP) asset management using Remote Condition Monitoring (RCM) data
DatabasesPythonSQLMachine Learning

PHD Researcher

Oct 2011Dec 2015 · 4 yrs 2 mos

  • PhD project in Fault diagnostics for engineering systems
  • The main aim of this thesis is to investigate available techniques and develop a methodology for the fault detection and diagnostics for two engineering systems, namely railway point systems and three-phase separators. The two systems represent two different situations occurring when dealing with fault detection and diagnostics. The first system has only one observable sensor which represents the operation of the system, while for the second system several sensors are available to monitor the operation of the system.
  • The fault detection of the railway point systems (RPS) was performed on the measured current from the motor of point operating equipment (POE). A threshold based alarm technique is commonly used by railway infrastructure operators. The inability to timely detect the incipient faults of the RPS is a big deficiency of such a technique. The method of One Class Support Vector Machines (OCSVM) together with some elastic metrics has been proposed to overcome this weakness of the threshold based alarm technique.
  • The fault detection of three-phase separators (TPS) on chemical process plant was performed given the sensor readings of flow and level transmitters of TPS. A threshold based alarm technique is commonly used by oil and gas infrastructure operators to monitor the operation of TPS. The late detection of faults of the TPS is a big deficiency of such a technique, since it causes the oil and gas processing plants to be shut down. The method of Bayesian Belief Networks (BBN) has been proposed to overcome this weakness of the threshold based alarm technique.

Biomapas

Data management specialist

Feb 2009Jul 2010 · 1 yr 5 mos

  • Statistical programming with SAS, data management with EpiData and ClinTrial, maintenance of web page content
Machine Learning

Kinder garden gandriukas

Computer technician

Dec 2008Jun 2009 · 6 mos

  • Maintenance of computer network, maintenance of web page
DatabasesData Management

Education

University of Nottingham

PhD — Civil Engineering

Jan 2011Jan 2014

Lund University

Master's degree — Statistics

Jan 2010Jan 2011

Kaunas University of Technology

Master's degree — Applied Mathematics

Jan 2009Jan 2011

Kaunas University of Technology

Bachelor's degree — Applied Mathematics

Jan 2005Jan 2009

VDU Rasos gimnazija

Jan 1998Jan 2005

Stackforce found 100+ more professionals with Machine Learning & Data Engineering

Explore similar profiles based on matching skills and experience