I'm a bioinformatics and health data science researcher with a PhD in Ageing and Chronic Diseases.
My work combines computational biology, machine learning, and reproducible data workflows to study clinically relevant problems in infectious diseases, with a strong focus on HIV-1 and malaria.
- π¬ Researcher at ICVS, School of Medicine, University of Minho
- 𧬠Currently working on AI-based prediction of HIV-1 tropism from routine sequencing data
- π¦ Contributing to Malaria therapeutic failure prediction using machine learning
- π οΈ Building tools for sequence analysis, biomedical knowledge graphs, and data integration
- π Interested in translational bioinformatics, infectious disease genomics, and health data science
- HIV-1 evolution, diversity, and drug resistance
- Machine learning for infectious disease genomics
- Biomedical knowledge graphs and graph-based learning
- Bioinformatics pipelines and reproducible research
- Health data analysis and translational computational biology
A machine learning pipeline for patient-level malaria therapeutic failure prediction.
Python toolkit for local HIV-1 sequence alignment, subtyping, and gene splitting.
Reproducible pipeline for HIV coreceptor tropism prediction, integrating preprocessing, encoding strategies, and machine learning/deep learning models.
Scalable Snakemake pipeline for automated and standardized biomedical knowledge graph construction.
Languages & Data
- Python
- SQL / MySQL
- Git / GitHub
- ETL workflows
- Power BI
Bioinformatics & ML
- Machine Learning
- Biomedical Knowledge Graphs
- Sequence Analysis
- Drug Resistance Analysis
- Phylogeny
- Snakemake
I also enjoy contributing to teaching, mentoring, and science communication, especially in bioinformatics, Python, and digital tools for health research.
- π« Email: anaapspereira@gmail.com
- π ORCID: 0000-0001-6410-7972