CV

Summary

10+ years building cloud-native systems across genomics, healthcare, and AI. 1 first-author and 6 co-authored peer-reviewed publications with 125+ citations.

Work Experience

  • Senior Backend & Cloud Engineer
    Feb 2024 - Present
    Egen.ai
    Applied ML Systems. Technology services through cloud, data, AI, and platforms.
    • Built an automated data pipeline migration tool with an internal developer API for healthcare and precision medicine workloads on GCP.
    • Built a Twilio-based testing framework to automatically generate all mood and user behavior call combinations for a Voice AI system, replacing manual script configuration.
    • Hardened CI/CD pipelines, code quality gates, and security standards for a US financial client — catching production issues earlier and reducing error-prone contributions across the team.
    • Developed GCP Agents and A2A orchestration via MCP/ADK for a pharma & biotech corporation.
  • Senior Software Engineer
    Mar 2022 - Feb 2024
    Illumina, Inc.
    Genetic analysis technologies: personalized medicine, disease research, drug development.
    • Evaluated privacy-preserving data sharing solutions to reduce friction from restrictive genomic data policies.
    • Built machine learning pipelines on Illumina Connected Analytics (ICA), HPC, and AWS.
    • Designed and optimized Snowflake-based data models for genomics and GWAS pipelines.
    • Co-authored a pending patent in applied bioinformatics and genomic data processing.
  • Bioinformatics Engineer
    Oct 2018 - Feb 2022
    Cancer Research Center of Toulouse (CRCT)
    Translational cancer research center in Toulouse, focused on oncology and systems biology.
    • Designed reproducible ML pipelines for multi-omics analysis of pancreatic and lung cancer, scaled across cancer types.
    • 1 first-author and 6 co-authored peer-reviewed publications with 125+ citations in oncology and systems biology.
    • Built GARDEN-NET, a web tool for 3D chromatin interaction analysis, still in active use — published in Nucleic Acids Research.
  • Research Engineer
    Jul 2017 - May 2018
    Spanish National Supercomputing Center (BSC)
    European HPC center hosting the MareNostrum supercomputer.
    • Developed a platform for AQuAS to analyze health outcomes and quality indicators through an interactive geospatial map.
    • Automated server setup testing and reproducibility for the Elixir/OpenEBench European HPC infrastructure.
  • Research Technician
    Nov 2016 - Jun 2017
    Spanish National Cancer Research Center (CNIO)
    Spain's national cancer research institute, focused on molecular oncology.
    • Developed a reproducible workflow to identify mutation clusters in protein structures, including web-based visualization.
    • Contributed to the European OpenMinTeD NLP/text-mining project, developing services and APIs for biomedical text analysis.

Education

  • Master in Bioinformatics and Computational Biology
    May 2016
    Instituto de Salud Carlos III
    Master's Thesis: Structure-PPi v2.0: Module for Annotating Cancer-Related Single-Nucleotide Variants at Protein-Protein Interfaces.
  • Computer Engineering (Information Technologies)
    Sep 2015
    Universidad Complutense de Madrid
    Bachelor's Thesis: Computer Vision Adapted to Optical Coherence Tomographies.

Expertise

Cloud & Platforms

  • Terraform
  • GCP
  • AWS
  • Kubernetes / Helm
  • Docker, Podman
  • CI/CD (Concourse, Jenkins, GitHub Actions)

Data Analysis & ML

  • Python (Pandas, Polars, Scikit-Learn)
  • R (Tidyverse, Tidymodels, DESeq2)
  • SQL (Snowflake, BigQuery, PostgreSQL)
  • Generative AI / LLMs
  • Visualization (Seaborn, ggplot2, Dash, D3.js)

Bioinformatics

  • Workflow managers (Nextflow, Snakemake)
  • GWAS (Hail, Regenie, Plink)
  • Genome Databases (Ensembl, UCSC, GENCODE, TCGA)
  • Pathways (Reactome, KEGG, GSEA)
  • Structural Bioinformatics
  • Somatic Mutation Analysis (COSMIC)

Languages

  • Spanish (Native)
  • English (Professional Working Proficiency)
  • French (Professional Working Proficiency)

Portfolio

Publications

Teaching

  • Instituto de Salud Carlos III (ISCIII)
    Role: Graduate Course

    Designed and delivered an intensive 15-hour module on Text Mining and Natural Language Processing (NLP) within the Master in Bioinformatics and Computational Biology program.