Skip to content
View soutogustavo's full-sized avatar

Block or report soutogustavo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please donโ€™t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
soutogustavo/README.md

Hey ๐Ÿ‘‹ I'm Gustavo

Senior Data Scientist based in Berlin with 9+ years building and shipping ML systems that run in production, not just notebooks. I specialize in time-series forecasting, anomaly detection, and end-to-end MLOps applied to industrial IoT and telematics domains. Recently expanded into agentic AI, designing multi-agent pipelines with LangGraph and A2A frameworks.

๐Ÿ“ Berlin, Germany ๐Ÿ“ง Email: ghsouto@gmail.com ๐Ÿ”— LinkedIn: soutogustavods


๐Ÿ”ง Tech Stack

Area Tools
Machine Learning Scikit-Learn, LightGBM, XGBoost, PyTorch, Isolation Forest, HMMs
LLM & Agentic AI LangGraph, LangChain, A2A, Crawl4AI, LangSmith, MCP
MLOps & Production MLflow, Docker, FastAPI, CI/CD (GitHub Actions), Model Monitoring
Data Engineering PySpark, Databricks, Apache Spark
Cloud AWS (SageMaker, S3, Lambda, Athena)
Visualization Streamlit, Plotly/Dash
Languages Python, SQL

๐ŸŽ“ Education

  • PhD Candidate (ABD), Computer Science โ€” TU Dortmund / Fraunhofer ISST. Research on anomaly detection in spatial time-series data. 3+ peer-reviewed publications.
  • M.Sc., Electrical and Computer Engineering โ€” Federal University of Rio Grande do Norte (UFRN)
  • B.Tech., Computer Networking and Telecommunications โ€” Universidade Estรกcio de Sรก (FATERN)

๐Ÿ“ซ Open to

Senior Data Scientist and ML/MLOps Engineer roles (full-time or contract) in industrial, energy, or IoT domains. EU work authorization. Available immediately.

Pinned Loading

  1. cc-fraud-detection cc-fraud-detection Public

    Evaluate a transactional dataset and build a Machine Learning model to detect frauds.

    Jupyter Notebook

  2. mlflow-infra mlflow-infra Public

    MLflow tracking server with PostgreSQL and MinIO, designed for DS/MLE teams

    Python

  3. spark-license-forecasting spark-license-forecasting Public

    Forecasting monthly liquor (new) license issuance in Chicago using PySpark and Databricks.

    Jupyter Notebook

  4. spark-sql-retail-analysis spark-sql-retail-analysis Public

    Exploratory data analysis on the UK Online Retail dataset using Databricks and SQL.

    Jupyter Notebook

  5. rossmann-forecasting-benchmark rossmann-forecasting-benchmark Public

    [WIP] End-to-end retail forecasting pipeline using a hybrid Clustering + Prophet architecture. Features automated model versioning via MLflow and store-level factor tuning to optimize local demand โ€ฆ

    Jupyter Notebook

  6. review-radar review-radar Public

    [WIP] An AI-powered pipeline that scrapes Google Maps reviews, identifies recurring negative themes, and generates specific operational recommendations for small business owners.

    Python