Anamika Sinha — AI Researcher
Data scientist with hands on knowledge of machine learning , natural language processing using neural networks, big data storage as well as experiment design. With a Masters in Data Science from Berkeley combined with experience in data analysis, business systems analysis and programming in transactional as well as data warehousing analytics, I bring a rich skill set to gain deep data insights rooted in statistical concepts. Languages: R, Python, SQL Competencies: Machine learning, cloud computing (AWS, Google Cloud), advanced statistics, SPARK, Git/Github, LINUX command line, natural language processing, Tableau, Scikit learn libraries Databases: Oracle, Hive, Postgres, HDFS, DB2, graph database Neo4j DeepLearning: Feed Forward Neural Network, Convolutional Neural Network(CNN), Recurrent Neural Network (RNN) Tools: Tableau, Git, , Hadoop, Hadoop Streaming, SPARK distributed computing, TensorFlow, Keras Select Data Science Projects (For more projects and code, please refer to github): Data engineering project on quality of care for Medicare patients - Built a pipeline to access publicly available data bout hospital performance, loaded into a HDFS data lake, created ER diagram and transformed data to use SQLs to answer key questions. Image recognition (deep learning) Kaggle project-Used TensorFlow to build a convolutional neural network for detecting facial keypoints to understand key principles of deep learning. Transfer learning with sentiment analysis(NLP) - Applied natural language processing techniques to understand model transferability from a source domain in order to optimize performance when a small amount of labeled data is available in the target domain (used Amazon review dataset). Adverse drug reactions chatbot (adriabot.com)- Facilitating easy Information retrieval about adverse drug reactions from the biomedical literature and presenting it to a patient in a format that is easy to use and easy to interpret.
Stackforce AI infers this person is a Data Science expert in Healthcare and SaaS industries.
Location: San Mateo, California, United States
Experience: 17 yrs 3 mos
Skills
- Data Engineering
- Machine Learning
- Data Analysis
- Natural Language Processing
- Data Science
- Business Analysis
- Software Development
Career Highlights
- Reduced troubleshooting effort by 50% in data quality.
- Generated $4.2 million savings through patient ranking system.
- Improved model prediction quality by 12% in production.
Work Experience
Tendo
Principle Data Scientist Consultant (11 mos)
agilon health
Lead Data Scientist (1 yr 2 mos)
Senior Data Scientist & Scrum Team Lead (2 yrs 2 mos)
SugarCRM (Through Node.io acquistion)
Senior Data Scientist (1 yr 2 mos)
Node.io
Data Scientist (1 yr 1 mo)
UC Berkeley School of Information
Graduate Student of Data Science (2 yrs)
Silicon Tech Lab/ UC Berkeley
Analytics Lead (3 yrs 9 mos)
Infosys
Software Programmer (5 yrs)
Education
Master's degree at UC Berkeley School of Information
Bachelor’s Degree at BIT Sindri