Santosh - Data Scientist, Machine Learning Engineer
Ref : 200831M002-
94200 PARIS
-
Data Scientist, Data Analyst (40 ans)
-
Totalement mobile
-
En profession libérale
Work Experience
Danone France
LEAD DATA SCIENTIST OF GLOBAL TEAM July 2022 - May 2023
• Led the development and deployment of machine learning and statistical models into production for several country’s markets based on
business requirements using Python, Spark, PySpark, Databricks, Azure data lake, Azure data factory, Snowflake, MLflow, Terraform, DevOps
tools (CI/CD, Kubernetes, Azure Artifactory/Package Management) and JIRA.
• Led data science and data engineering enhancement and support to existing machine learning products in DACH (Germany, Austria, and
Switzerland), and Benelux market of Danone covering multiple use cases on propensity modeling, community clustering, Lead scoring etc.
• Led the development and deployment of various machine learning models, AI, and statistical models into production for South East Asia market
of Danone covering use cases such as churn, migration, OCR, & consumption.
• Managed the several tasks involving data engineering, BI, GDPR, and compliance such as egression of model results into Power BI systems,
Snowflake architecture and schema creations, working on Power BI dashboard with DACH country markets, obfuscation framework for data
handling regarding GDPR, developing data ingestion and egression framework w.r.t. GDPR and Compliance.
• Managed stakeholder relationship with countries’ business units in Europe and in Asia on weekly basis on product support, enhancement and
bug fix for data science and data engineering team.
Johnson & Johnson France
EMEA ANALYTICS MANAGER - CUSTOMER ENGAGEMENT June 2021 - May 2022
• I workedwith Engagement Lead of Europe, and Country Platform Lead of Belgium, Netherlands, and Spain to launch Artificial Intelligence driven
Omnichannel engine that generates next best action plan for sales force with content recommendation for several prescribed drug brands of
multiple therapeutic areas.
• My responsibilities was split into 50% technical development using python, azure, databricks, & spark and 50% was on leading and mentoring 15
person data science and data engineering team on various projects including main rollouts, and other core technical projects such as ConfigLab,
quarterly AI omnichannel engine enhancements.
• Led the developments of machine learning models into production setup with data science and data engineering team aligning go-to-market
strategy of country’s market.
• Used Python, SQL, & Azure daily to generate machine learning driven business insights to drive go-to-market strategy and change management
in country’s markets on a monthly basis with country platform lead of Belgium and Netherlands.
Sentium AI Paris, France
FOUNDER Jan. 2020 - Current
• I am a founder of Sentium AI (********), which is incubated at Station F in Paris.
• At Sentium AI, I am developing human-augmented design assistant to improve usability of mobile apps using cutting edge computer vision
and deep learning.
• We are developing MVP using python ecosystem including Tensorflow, openCV, keras, numpy, matplotlib etc.
Prisma Media Paris, France
CONSULTANT - DATA SCIENCE & AI Aug. 2019 - Jan 2020
• I was a tech lead responsible for implementing data science and data engineering using Python for a development of health and nutritional
support mobile app.
• Developed rule based Natural Language Processing algorithm in python to curate text data of food items and nutrients.
• Developed adaptive datamining algorithms in python using rest APIs, beautiful soup, and pandas to parseweb datawhile considering semantic.
• Built data pipeline and data architecture with SQL backend.
Authentify AI Paris, France
CO-FOUNDER Oct. 2018 - Dec. 2019
• Authentify AI is developing a Natural Language Processing technology that helps legal and compliance professionals in financial institutions to
be compliant underlying regulatory change using state of the art technology such as tensorflow and RNN.
• The startup was conceptualized during the Entrepreneur First deep tech incubation program in Paris.
• I conducted customer development to understand the business need and customer requirement about regulatory change management and
financial compliance.
• Developed semantic parsing in python for financial regulations texts.
Entrepreneur First Paris, France
ENTREPRENEUR Sep. 2018 - Jan. 2019
• Cohort member of incubation program of Entrepreneur First to found a deep technology startup.
• During the program I worked on machine learning driven startup ideas.
• Technological product development, generating new customer leads and traction, and business model development.
Boston Scientific Paris, France
MACHINE LEARNING SCIENTIST Jan. 2017 - Dec. 2018
• Received annual innovation award from Boston Scientific for my work on machine learning and NLP for identification of clinical thought leaders.
• Text analytics and NLP using python and R for a large scale web data including medical research and social media data.
• Developed various machine learning methodologies for analyzing large scale social media and medical research data.
• Conducted data analysis such as Clustering, Sentiment Analysis, Fuzzy Matching, Topic Modeling, Quantitative Discourse Analysis etc. on a
large scale web data to identify key opinion leaders in medical research.
• Development of big data infrastructure such as building data pipeline, API endpoints, sql backend architecture to analyze medical research
data.
• Developed data driven, and data visualization web app.
• Enhancing scalability and performance of web application.
L’Institut national de recherche en informatique et en automatique Paris, France
VISITING RESEARCHER Sep. 2016 - Dec. 2016
• I worked on Ontology and RDF Semantics.
• I worked on consolidating Java library for semantic analysis.
Cambridge University Cambridge, UK
RESEARCH ENGINEER May. 2015 - Feb. 2016
• Semantic learning and analysis algorithm in Python for data validation.
• Developed learning algorithm using Python to structure an unstructured and semi-structured data format such as XML, pdf, and html documents.
• Developed unsupervised machine learning algorithms using python (scikit-learn, pandas, matplotlib etc) to identify and predict patterns in
public procurement contract data.
Independent Paris, France
DATA SCIENCE FREELANCING Mar. 2016 - Current
• Provided consulting services for short duration projects to many industries on various data science and machine learning projects using Python,
R, SQL, NoSQL, Neo4j, and Linux.
• Provided consultancy on topics such as Bayesian statistics, supervised and unsupervised machine learning.
• Consulted on data analytic projects such as institutional loan data management, sale forecasting, manufacturing process optimization, market
survey & mapping, data engineering and data science for healthcare.
Arizona Department of Education Phoenix, USA
LEAD DATA ARCHITECT May. 2014 - Apr. 2015
• Led database and application development team to build a highly efficient and scalable relational (sql) database system.
• Gathered requirements from stakeholders to develop a database system.
• I worked on database designing, logical data modelling, data documentation, data mapping, data migration, and data integration for building
a new data warehouse system.
• Data mining and statistical analysis such as Regression analysis, decision tree, and clustering in Python, and SAS for learning analytics.
CVS Health Scottsdale, USA
DATA SCIENTIST Sep. 2013 - Apr. 2014
• Managed Terabytes of healthcare data in Teradata and Oracle to analyze health and medicare plan for over 65 million members.
• Conducted data analysis using techniques like a decision tree, support vector machine, regression modeling, and ANOVA in SAS to help CVS
Health clients to make key decisions regarding health and medicare plan membership of their employees.
• Modeling of time series data in pharmacy benefit management and CVS customer care management.
U.S. Department of Transportation Tucson, USA
GRADUATE RESEARCH FELLOW Jun. 2012 - Jul. 2013
• Developed and implemented AI based Agent-based simulation modeling system to model hazmat drivers route choice.
• Managed stakeholders involved including US. DOT and University of Arizona
Boeing Tucson, USA
GRADUATE RESEARCH FELLOW Jul. 2011 - May. 2012
• Developed decision making support system using machine learning optimization algorithms for the analysis of aircraft design change propagation.
• Conducted research works in simulation with application to aircraft design change propagation.
University of Arizona Tucson, USA
GRADUATE RESEARCH ASSISTANT Aug. 2011 - Jul. 2013
• Developed Bayesian Belief Network, Reinforcement learning, and depth first search to model a probabilistic behavior of driver’s route choice.
• Developed Mix-integer programming mathematical models and a novel Optimization Algorithms for route optimization considering hazardous
material transportation risk, toll pricing and traffic congestion.
• Worked on developing optimization algorithms, statistical modeling such as ANOVA, design of experiments, and regression analysis to analyze
product design changes.
• Led the project team to develop an automated shop floor control system using tools such as .NET, VB, MS Access, Web Service, SOAP, and Arena.
University of Arizona Tucson, USA
DOCTORAL RESEARCHER Aug. 2011 - Jul. 2013
• Conducted publishable research works in topics of AI, optimization, and simulation modeling.
• Published 7 papers in international journals, and conferences
• Completed many doctoral courses and doctoral seminars.
• Attended and presented in several international conferences.
INRIA Metz, France
VISITING RESEARCHER Apr. 2011 - Jul. 2011
• Developed several optimization algorithms to solve computationally expensive combinatorial optimization problems.
• Worked on statistical analysis to analyze a performance of several optimization algorithms.
Procter & Gamble/IIT Kharagpur Mumbai/Kharagpur, India
PROJECT LEAD - DATA ANALYTICS Jun. 2010 - Mar. 2011
• This was a joint project between P& G and IIT Kharagpur.
• I led the project to develop a data driven analytical solution to enhance and optimize logistics operation of P&G India.
• Headed a project to develop a data-driven application for optimizing and enhancing logistics operation that saved 16% annual logistic cost in
India.
• Analyzed Terabytes of logistics data to devise logistics strategy for vehicle selection, truck loading and capacity optimization, and vehicle routing.
IIT Kharagpur Kharagpur, India
MACHINE LEARNING RESEARCHER Jun. 2008 - May. 2010
• Developed novel machine learning, data mining, and time series forecasting algorithms with applications to transportation, manufacturing,
and finance.
• Developed a time series forecasting algorithm involving temporal association rules, associated clustering, and ARIMA for stock prediction and
flood forecasting.
• Developed unsupervised machine learning algorithms with application to industrial project scheduling.
• Conducted research on optimizing outbound logistics network design for a manufacturing supply chain using optimization heuristics.
Awards & Achievement
2017 ImagineIF Innovation Award, Boston Scientific Massachusetts, USA
2011 Postgraduate Research Fellowship, University of Arizona Tucson, USA
2010 Best Master Dissertation Prize, Indian Institute of Technology (IIT) Kharagpur Kharagpur, India
2008 Full Graduate Scholarship, Indian Institute of Technology (IIT) Kharagpur Kharagpur, India
2008 Ministry of Human Resource Development Fellowship, Ministry of Human Resource Development India
2003 University Undergraduate Fellowship, Amravati University Amravati, India
Technical Skills
Programming Python (Pandas, NumPy, SciPy, Matplotlib, BeautifulSoup, SciKit-Learn, OpenCV, Keras, Tensorflow), R, SQL, SAS, Matlab, C, Java,
Spark, Scala, Linux Bash, LaTeX, Visual Basic
Back-end All SQL databases, NoSql, Graph Database Neo4j, Hadoop, MapReduce, Hive, Pig, AWS, GoogleCloud, REST API
Front-end HTML, CSS, JavaScript
Other tools Git, Power BI, Tableau, Arena, AnyLogic, Lingo, CPLEX, GAMS, SPSS, STATA
Languages English, French(Intermediate), Hindi, Marathi
Education
University of Arizona Tucson, USA
PHD CANDIDATE IN SYSTEMS ENGINEERING Jul. 2011 - Aug. 2013
• Artificial Intelligence, Machine Learning, and Optimization.
University of Arizona Tucson, USA
MASTER OF SCIENCE IN SYSTEMS ENGINEERING Jul. 2011 - May. 2013
• Algorithms, Engineering Statistics, Optimization, Mathematical Modeling, Financial Modeling, and Systems Engineering.
Indian Institute of Technology (IIT) Kharagpur Kharagpur, India
MASTER OF TECHNOLOGY Jul. 2008 - May. 2010
• Quantitative Methods, Data Analysis, Simulation, Operational Research, Engineering Management.
Amravati University Amravati, India
BACHELOR IN ENGINEERING MECHANICS AND MATHEMATICS Jul. 2003 - May. 2007
• Mathematics, Physics, Engineering Mechanics, Structural Analysis.