M

Matías Battocchia

About

Detail

Mendoza, Mendoza Province, Argentina

Contact Matías regarding: 
work
Full-time jobs

Timeline


work
Job
school
Education

Résumé


Jobs verified_user 0% verified
  • C
    Gen AI Engineer
    Cdigo Brazuca
    Mar 2024 - Jun 2024 (4 months)
    • Created a ChatGPT assistant that generated Solana program interfaces based on smart contract descriptions, utilizing function calls to lint and refine the interfaces. • Implemented prompt test-driven development using promptfoo. • Engineered prompts to generate program interfaces directly from Solana Rust source code.
  • S
    Gen AI Engineer
    Soko Solutions
    Sep 2023 - Dec 2023 (4 months)
    • Development of a ChatGPT based chatbot using Firebase, Node.js, React.js, TypeScript and Tailwind. Heavy use of ChatGPT function calls. Automated testing and CI/CD workflows. • Retrieval augmented generation (RAG) using Sentence Transformers for embeddings and Chroma vector database.
  • ConverseNow.ai
    Machine Learning Engineer
    ConverseNow.ai
    Sep 2021 - Jul 2023 (1 year 11 months)
    • NLU model development for voice bot intent classification: Extracted data from BigQuery. Prepared data using Pandas. Analyzed data for bot intent design. Assisted dataset annotation with ChatGPT. Trained and tested models using HuggingFace. Did ML tracking, conducted experiments and compared transformer-based models. Exported models to ONNX format. Utilized DVC for model versioning. • Implemented an end-to-end pipeline with DVC and Papermill for the NLU model, from dataset generation to model deployment. Established a feedback loop using Jira API, getting annotations from reported issues, checking for solved issues and updating those automatically, sending reports to Slack. This automation reduced the manual work of two–three days to ze
  • V
    Data Engineer
    Vindow
    Aug 2020 - Sep 2021 (1 year 2 months)
    • Designed and developed a serverless data acquisition and model training pipeline on AWS using Step Functions, Lambda, ECS, S3, MongoDB, CDK — I did it almost entirely in TypeScript. It replaced a single process that used to run on AWS EKS and took ~10 hours to complete. Due costs involved the company updated its data/model monthly; with the new pipeline this was done almost on a daily basis, as it just needed around half an hour to finish. • Developed several data services and exposed them in a Node.js web application. One of them was a document recognizer based on AWS Textract. Another was the retrieval side of a recommendation system in Elasticsearch.
  • E
    Lead Data Engineer
    Electriq Power
    Jul 2020 - Dec 2022 (2 years 6 months)
    • Management and mentorship of the data team. I created a team of 3 junior, 1 semi-senior devs from the ground up. • Designed and leaded the development of a real-time IoT data ingest pipeline, data lake and APIs in GCP using Pub/Sub, Cloud Run, Cloud Functions, Dataflow, InfluxDB, FastAPI. This pipeline replaced a legacy process that run in a single VM. Going serverless provided redundancy and robustness to the pipeline. In addition, the improved data schema and APIs reduced some queries from 60 seconds to 2 seconds.
  • Mutt Data
    Machine Learning Engineer
    Mutt Data
    Jul 2019 - Aug 2020 (1 year 2 months)
    • End-to-end development of data products. Time series featurization and forecasting using several estimators such as XGBoost and FB Prophet.
  • P
    Machine Learning Engineer
    Properati (OLX)
    Oct 2017 - May 2019 (1 year 8 months)
    • Set up a data warehouse on BigQuery, using Airflow for ETL processes. • Created models using scikit-learn and deployed them as REST APIs using Flask and Docker.
  • s
    Contributor
    socios.red,
    data journalism platform ingests and blends data from several sources. ArangoDB, database. Kubernetes (Azure AKS).
Education verified_user 0% verified
  • Universidad de Buenos Aires
    Licentiate
    Universidad de Buenos Aires
    Jan 2009 - Jan 2018 (9 years 1 month)
This is a community-created genome.