cv

Basics

Name Hector Climente-Gonzalez
Label Applied Machine Learning Researcher
Summary Lead scientist with 12+ years of experience in computational biology & bioML. Leveraging genetics, epigenetics and genomics to inform early target discovery and precision medicine.

Education

Work

  • 2024.11 - Present

    London, UK

    Lead Data Scientist
    Novo Nordisk, Research & Early Development
    Key skills: Human genetics, AI/ML, Real-world data, Strategy development
    • Develop, deploy and mine AI/ML models for SNP interpretation, early target discovery and precision medicine. Influenced decisions on target progression.
  • 2023.03 - 2024.11

    London, UK

    Senior Data Scientist
    Novo Nordisk, Research & Early Development
    Key skills: Human genetics, AI/ML, Pipeline development, Real-world data
    • Developed the 2-year VP area strategy on AI/ML for genetics-based target discovery
    • Developed Nextflow pipelines for large-scale genetics analysis (e.g., colocalization, Mendelian randomization) following best nf-core practices. Massively decreased compute time and human effort for common workflows.
  • 2020.05 - 2023.02

    Kyoto, Japan

    Special Postdoctoral Researcher
    RIKEN,Center for AI Project (AIP)
    Key skills: Deep learning, Nonlinear feature selection, Graph regularization
    • Developed novel deep learning architectures for DNA sequence learning and computer vision
    • Developed graph methods to study RNA-seq gene expression profiles in leukemia
    • Developed strategies for FDR-controlled feature selection
  • 2016.10 - 2020.04

    Paris, France

    Ph.D. student
    Institut Curie & Mines ParisTech, Centre for Computational Biology
    Key skills: Machine learning, Human genetics, Statistical interactions, Kernel methods
    • Developed machine learning methods that leverage graphs to study the genetics of complex diseases
    • Applied methods to discover cancer and autoimmune genetic markers in GWAS (single-SNP and epistasis)
    • Authored three software packages: martini (R, in Bioconductor), and pyHSICLasso and spada (Python, in PyPI)
  • 2014.09 - 2015.05

    Barcelona, Spain

    Head of Biocomputing
    Anaxomics Biotech Ltd., Biocomputing department
    • Developed and maintained pipelines for the statistical analysis of RNA-seq, microarray, WGS and MS-MS data
    • Managed the bioinformatics area including decision-making, intern supervision, and interdepartment coordination
  • 2013.12 - 2016.08

    Barcelona, Spain

    Research assistant
    Pompeu Fabra University, Research programme on biomedical informatics
    • Conducted a large scale study of the involvement of alternative splicing in cancer
  • 2013.06 - 2013.12

    Barcelona, Spain

    Research intern
    Centre for Genomic Regulation
    • Developed a tool to massively analyze DNA-protein interfaces from structures in the Protein Data Bank
    • Co-analyzed RNA-seq and mass spectrometry data from Mycoplasma pneumoniae

Skills

Programming
Python (numpy, pandas, scikit-learn)
Deep learning (PyTorch)
nextflow (nf-core, nf-test)
R
Big Data
Accelerated computing (CUDA, HPC)
Cloud computing (Azure)
Databases (SQL)
Omics
GWAS
Epigenetics (ATAC-seq)
RNA-seq (single cell and bulk)
MS-MS proteomics
DevOps
Virtual environments (Docker, conda)
MLOps (Azure ML, MLFlow)
Testing
CI/CD
Open source
git
Professional Skills
Communication: Lead writer of articles, project proposals & documentation; engaging speaker at global conferences & senior leadership briefings; proactive at seeking and disseminating best practices.
Project management: Management of projects within and between teams, and across companies; experience with Scrum, including sprint planning, stand-ups and retrospectives.
Interpersonal: Active listener, interested in knowledge sharing; onboarding and mentoring of junior colleagues.

Certificates

Awards