Develop Machine Learning models using Python, Pandas, Scikit-Learn and other related technologies over Hadoop clusters using Apache Impala.
Analyze data and use clustering and correlation techniques to obtain business insights.
In company trainings related to Data Science and Machine Learning.
Data Scientist
Jampp
Feb 2016 - Aug 2018(2 years 7 months)
Designed, developed and pushed into production and mantained a data product that segments users according to their likelihood of conversion and integrates with the company’s RTB bidder’s strategy.
The system efficiently queries PrestoDB (a distributed SQL layer over Hadoop) to obtain the users
relevant information and MySQL for campaign configurations. Internally the app uses MySQL, Redis
and Amazon S3 to store data, employs scikit-learn for classification and feature engineering, and
trains a random forest model to segment users.
Created a data app for scoring with a quality index different traffic sources (according to the level of activity of the users they bring).
Developed a decision tree classifier that optimizes the configuration of
External Consultant
Piaggio Group
Mar 2014 - Oct 2015(1 year 8 months)
Business plan design and implementation of commercial vehicle expansion project to LATAM.
Preparation of quarterly automobile macro sector reports (time series forecasting in Python with
Statsmodels).