Eliseo Papa
Senior data scientist and clinician working on AI applied to Drug Discovery. Background in biomedical engineering and bioinformatics.
Experienced in
Machine learning, python and data engineering architecture - delivering production-grade machine learning pipelines to computational biology, statistical genetics and early discovery groups.
Leading academic/industrial research collaborations, managing multi-year scientific projects, writing funding grants, leading product decisions in a software team.
Knowledge graph learning from text
Analysis of clinical trial data - current and retrospective analysis up to Phase 2b
Drug target identification and prioritization - Genetics, integrative and network-based approaches
Sequencing analysis - QC pipelines and statistical analysis of 16S and RNA-seq experiments
Machine learning for biomarker identification - human microbiome and immune cells subpopulations
Medicine - anesthesia, genetics, general medicine
Professional Experience
2020 - now
Director AI Engineering, AstraZeneca
- scale up effective engineering team practices to a > 50 organization
- foster customer and value oriented culture
- scale up interviewing and culture-sharing
- establish academic collaborations and set a publication strategy
- stabilizing existing innovation team and bring them to the next phase of maturity
2018 - 2020
AI & Data Science Lead, AstraZeneca
- Fast-paced hiring to kickstart and scale an AI team within R&D IT.
- Leading separate teams dedicated to:
- using BERT language-based models in NLP to extract relationship from biomedical literature
- build a large-scale knowledge graph with internal data
- leverage graph ML for target identification and repurposing
- support drug discovery decision with recommender systems.
- rewrote the interview process,
- established new ways of working
- directly contributed code to new projects.
Feb2017 - 2018
Senior Data Scientist, Translational Advanced Analytics, Biogen
Feb2016 - Feb2017
Manager, Digital Health Technology & Data Science, Biogen
Projects:
1) Advanced analytics & scientific computing
I employ my knowledge of machine learning, python and data engineering architecture to deliver production-grade analysis pipeline to Biogen’s computational biology group, statistical genetics group and early discovery groups. This includes models based on RF, SVM, xgboost (scikit-learn), vowpal wabbit applied on biomarker and genomics data, as well as applied deep learning approaches (Keras, Tensorflow) applied on MEDLINE data (22m records) plus clinical trial literature and data. Development follows an iterative and agile process, done in collaboration with scientist at all levels of the organization, and which requires a deep understanding of the domain as well as the technologies available. My broad background (medicine, ML, biomedical engineering) helps me to bridge these diverse skill sets and communicate across disciplines.
2) Open Targets liaison
Working on computational biology and genomics analysis which leverage large scale human data sets with the goal of identify and prioritize new drug targets. Direct interactions with Open Targets members (the Wellcome Trust Sanger Institute, the European Bioinformatics Institute, and GSK) and Biogen scientists. Manage and influence the scientific program working to enhance its quality and matching the Open Targets capabilities to Biogen needs. This work has resulted in several collaborations approved and funded by Open Targets that benefit Biogen. I routinely represent Biogen externally, including speaking invitations at informatics and scientific venues.
3) Development of the Open Targets Platform
I am one of the team lead, responsible to plan and manage the work of 2 Biogen and 6 EBI software engineers including weekly planning and engineering duties. I manage the long-range planning and decision-making, including creating and owning the de facto Open Targets informatics platform roadmap, and making sure the team adopts the tool and process. This has directly resulted in new features important to Biogen in 2016’s Open Targets informatics platform releases.
4) Biogen’s principal liaison for the and Genomics England discovery forum
Advised Genomics England industrial strategy as part of the GENE consortium together with VPs and Directors from AZ, GSK, Takeda, Abbvie, Alexion and others.
2015-2016
ML Scientist, OpenBiome / Finch Therapeutics Cambridge,MA (remote)
- Statistical analysis of microbiome communities during fecal transplant delivery by capsule
- Project consulting for large pharma organization to analyze prevalence of drug resistant genes in the general population
- Retrospective analysis of Phase 2b trial results and additional microbiome sequencing to measure efficacy
2014-2015
Chief Scientist, Klappo/um.ai, London,UK
- Building a recommendations engine to craft context-aware suggestions for every meal.
- Normalization and concept mapping used to organize natural language data related to food onto a structured ontology
- Data analysis and visualization
- management team responsible for the business plan and investor relations
- medical and scientific direction
2014
Junior Doctor, Imperial College NHS Trust,North West London, UK
- General Practice
- Care of Elderly, orthopaedic outreach
- Anaesthetics & ITU
- Gastroenterology & Acute medical take
2013
Theoretical System Biology group, Prof. M. Stumpf, Imperial College
- Integrating Chip-seq, RNAseq and transcriptomics data describing nitrogen stress response in e.coli
- Organized query with a relational db, structured ontology and visualization
- Bayesian model selection of potential e.coli nitrogen stress pathways
2012
Consultant, SERES Health, Cambridge, MA (remote)
- Computationally designed and predicted fitness of synthetic microbial communities intended for therapeutic transplantation using unsupervised/supervised learning..
- Provided strategic input and scientific advice.
2009-2013
Alm Laboratory for Microbiology, Prof. Eric J. Alm, MIT
- Supervised classification of microbiome samples from pediatric IBD patients
- Feature extraction using ecological and phylogenetic information
- Quality control and downstream analysis of Human Microbiome Project sequencing pipeline
2009
Founder, Enumeral diagnostics, Cambridge, MA
- MIT $100K Entrepeneurship competition semifinalist
- Developed high-throughput data acquisition and analysis platforms, scaling the technology to production levels.
- Contributed to the development of the microfluidic platform at the core of the company intellectual property
2006-2009
Laboratory of Hidde L. Ploegh, Whitehead Institute, MIT
- Unsupervised learning to map affinity and isotype of secreted antibodies in individual primary B cells.
- Describing immune response progression by statistical modelling of the B cell population
- Created a cluster pipeline to automate image analysis of fluorescence microscopy data
- Murine antibody cloning and expression; fluorescence tagging
- Real time fluorescence microcopy; advanced image analysis
2004-2005
Biomedical Nanotechnology Group, Prof. W C. Chan, University of Toronto
- Nanoparticles cytotoxicity
- Quantum Dots characterization (TEM, Absorption, PL, X-IRD)
- Real time fluorescence microscopy, single molecule spectroscopy and biophysics.
2003
Biomaterials Group, Prof. M.C.Tanzi, Politecnico di Milano, Italy
- Morphological, mechanical and functional characterization of polyurethane scaffolds for tissue engineering applications.
Education
2013
MBBS, Imperial College London
Medicine & Surgery
2006-2012
Ph.D, Harvard Medical School/Massachusets Institute of Technology
- Medical Engineering & Medical Physics, Harvard/MIT HST Institute
- Thesis: High-throughput experimental and computational tools for exploring immunity and the microbiome
2008
Sc.M., Massachusets Institute of Technology
Mechanical Engineering
2005
BASc (Honors), University of Toronto
Engineering Science, Biomedical Option
Computing
Python - pytorch, DGL, DGL-KE, numpy, vega-lite, seaborne, pandas, lxml, spacy, requests.
Cloud ops - Docker, Kubernetes, Google Cloud, Azure cloud
R - reporting packages; ggplot2, knitr, shiny.
Web development - minimal knowledge of react, JSX, MDX, dc.js
Linux - Git, bash/zsh, sge/torque/pbs schedulers
SQL - postgresql, sqlite3, clickhouse
NoSQL - elasticsearch
Product & engineer manager in an agile team.
Contributor to open source projects: open targets , kubernetes, luigi
Fellowships & Awards
2020
Early Oncology AstraZeneca Award - driving innovation, Accelerating novel target discovery
2012
Bursary recipient, Exploring Human Host-Microbiome Interactions in Health and Disease, Wellcome Trust Scientific Conferences
2010-2011
Postgraduate D Scholarship, National Science Engineering Research Council, Canada
2008-2009
Poitras pre-doctoral fellowship
2007
Martino Scholar, Harvard/MIT Health Science Tech. Inst.
2008
Martha Gray Prizes for Excellence in Research, Annual Forum, Harvard/MIT Health Science Tech. Inst.
2008
Competition Semifinalist, MIT 100k Business Plan
2005-2008
Postgraduate M Scholarship, National Science Engineering Research Council, Canada
2004
University of Toronto Life Sciences Award, University of Toronto
2004
NSERC Summer Research Award, National Science Engineering Research Council, Canada
2003
OUA Academic Achievement Award, Ontario, Canada
2003
#2 Canadian Army University Course Undergrad Scholarship,University of Toronto
2002-04
Silver T – academic athletic excellence, University of Toronto
2001
Ontario Scholar, Government of Ontario, Canada
Publications
A list is also available online
Journals
2021
Rozemberczki, B., Bonner, S., Nikolov, A., Ughetto, M., Nilsson, S. and Papa, E., 2021. A Unified View of Relational Deep Learning for Polypharmacy Side Effect, Combination Synergy, and Drug-Drug Interaction Prediction. arXiv preprint arXiv:2111.02916.
2021
Rozemberczki, B., Gogleva, A., Nilsson, S., Edwards, G., Nikolov, A., & Papa, E. (2021). MOOMIN: Deep Molecular Omics Network for Anti-Cancer Drug Combination Therapy. arXiv preprint arXiv:2110.15087.
2021
Gogleva, A., papa, E., Jansson, E., & De Baets, G. (2021, September). Drug Discovery as a Recommendation Problem: Challenges and Complexities in Biological Decisions. In Fifteenth ACM Conference on Recommender Systems (pp. 548-550).
2021
Mountjoy E, Schmidt EM, Carmona M, et al. An open approach to systematically prioritize causal variants and genes at all published human GWAS trait-associated loci. Nat Genet. 2021;53(11):1527-1533. doi:10.1038/s41588-021-00945-5
2021
Gogleva, A., Polychronopoulos, D., Pfeifer, M., Poroshin, V., Ughetto, M., Sidders, B., Ahdesmäki, M., McDermott, U., Papa, E., & Bulusu, K.C. (2021). Knowledge Graph-based Recommendation Framework Identifies Novel Drivers of Resistance in EGFR mutant Non-small Cell Lung Cancer. bioRxiv.
2021
Ochoa, D., Hercules, A., Carmona, M., Suveges, D., Gonzalez-Uriarte, A., Malangone, C., Miranda, A., Fumis, L., Carvalho-Silva, D., Spitzer, M., Baker, J., Ferrer, J., Raies, A., Razuvayevskaya, O., Faulconbridge, A., Petsalaki, E., Mutowo, P., Machlitt-Northen, S., Peat, G., McAuley, E., Ong, C.K., Mountjoy, E., Ghoussaini, M., Pierleoni, A., Papa, E., Pignatelli, M., Koscielny, G., Karim, M., Schwartzentruber, J., Hulcoop, D.G., Dunham, I., & McDonagh, E.M. (2021). Open Targets Platform: supporting systematic drug-target identification and prioritisation. Nucleic acids research.
2021
Ghoussaini, M., Mountjoy, E., Carmona, M., Peat, G., Schmidt, E., Hercules, A., Fumis, L., Miranda, A., Carvalho-Silva, D., Buniello, A., Burdett, T., Hayhurst, J.D., Baker, J., Ferrer, J., Gonzalez-Uriarte, A., Jupp, S., Karim, M., Koscielny, G., Machlitt-Northen, S., Malangone, C., Pendlington, Z.M., Roncaglia, P., Suveges, D., Wright, D., Vrousgou, O., Papa, E., Parkinson, H., MacArthur, J.A., Todd, J., Barrett, J., Schwartzentruber, J., Hulcoop, D.G., Ochoa, D., McDonagh, E.M., & Dunham, I. (2021). Open Targets Genetics: systematic identification of trait-associated genes using large-scale genetics and functional genomics. Nucleic acids research.
2020
Jackson, R.J., Jansson, E., Lagerberg, A., Ford, E., Poroshin, V., Scrivener, T., Axelsson, M., Johansson, M., Franco, L.A., & Papa, E. (2020). Ablations over transformer models for biomedical relationship extraction. F1000Research, 9, 710.
2018
Carvalho-Silva, D., Pierleoni, A., Pignatelli, M., Ong, C.K., Fumis, L., Karamanis, N., Carmona, M., Faulconbridge, A., Hercules, A., McAuley, E., Miranda, A., Peat, G., Spitzer, M., Barrett, J., Hulcoop, D.G., Papa, E., Koscielny, G., & Dunham, I. (2019). Open Targets Platform: new developments and updates two years on. Nucleic Acids Research, 47, D1056 - D1065.
2017
Koscielny G, An P, Carvalho-Silva D, Cham JA, Fumis L, Gasparyan R, Hasan S, Karamanis N, Maguire M, Papa E, Pierleoni A, Pignatelli M, Platt T […] Birney E, Barrett J, Dunham I.
Open Targets: a platform for therapeutic target identification and validation. Nucleic Acids Res. 2017 Jan 4;45(D1):D985-D994.
doi: 10.1093/nar/gkw1055. PubMed PMID: 27899665; PubMed Central PMCID:
PMC5210543.
2017
Fischer M, Bittar M, Papa E, Kassam Z, Smith M.
Can You Cause Inflammatory Bowel Disease with Fecal Transplantation?
Gut Microbes 2017 Jan 19:0. doi: 10.1080/19490976.2017.1283469
2012
Eliseo Papa, Michael Docktor, Christopher Smillie, Sarah Weber, Sarah Pacocha Preheim,Dirk Gevers,Georgia Giannoukos, Dawn Ciulla, Diana Tabbaa, Jay Ingram, David B Schauer, Doyle V Ward, Joshua R Korzenik, Ramnik J Xavier, Athos Bousvaros, Eric J Alm.
Non-invasive mapping of the gastrointestinal microbiota identifies children with inflammatory bowel disease.
PLoS ONE 2012;7(6):e39242.
2011
Rhiannon White, Sachiko Miyata, Eliseo Papa, Eric Spooner, Kleoniki Gounaris, Murray Selkirk, Katerina Artavanis-Tsakonas.
Characterisation of the Trichinella spiralis deubiquitinating enzyme, TsUCH37, an evolutionarily conserved proteasome interaction partner.
PLoS Negl Trop Dis. 2011 Oct;5(10):e1340.
2011
Katerina Artavanis-Tsakonas, Pia V Kasperkovitz, Eliseo Papa, Michael L Cardenas, Nida S Khan, Annemarthe G Van der Veen, Hidde L Ploegh and Jatin M Vyas.
The Tetraspanin CD82 is Specifically Recruited to Fungal and Bacterial Phagosomes Prior to Acidification.
Infection and Immunity 2011 79(3):1098-106\
2009
Adebola Ogunniyi, Craig Story, Eliseo Papa, Eduardo Guillen, J. Christopher Love.
Screening Individual Hybridomas by Microengraving to Discover Monoclonal Antibodies.
Nature Protocols 2009 4(5):767-82
2009
Jehnna L. Ronan, Craig Story, Eliseo Papa, J. Christopher Love.
Optimization of the surfaces used to capture antibodies from single hybridomas reduces the time required for microengraving.
Journal of Immunological Methods 2009, 340(2):164-9\
2008
Craig Story*, Eliseo Papa* (co-author), Chih-Chi Andrew Hu, Jehnna L Ronan, Hidde L Ploegh, J.Christopher Love.
Profiling Antibody Responses by Multiparametric Analysis of Single B Cells.
PNAS 2008 105(46):17902-7
2005
Hans Fischer, Eli Papa, Lichuan Liu, K. Sandy Pang, Warren C. W. Chan.
Preliminary Results: Exploring the Interactions of Quantum Dots with Whole Blood Components.
SPIE Proceedings 2005 5969,54
2004
Wen Jiang, Eli Papa, Hans Fischer, Sawitri Mardyani, Warren C.W. Chan.
Semiconductor quantum dots as contrast agents for whole animal imaging.
Trends in Biotechnology 2004 22:12
Posters
2016
Eliseo Papa, Andrea Pierleoni, Gautier Koscielny, Naruemon Pratanwanich, Oliver Stegle, Jeffrey Barrett, Ian Dunham, and the Open Targets Team
Open Targets : a platform for therapeutic target identification and validation
MASAMB 2016
2016
Miguel Pignatelli, Denise Carvalho-Silva, Jennifer Cham, Luca Fumis, Samiul Hasan,
Nikiforos Karamanis, Gautier Koscielny , Michael Maguire, William Newell, Eliseo Papa, Andrea Pierleoni, Francis Rowland, Sirarat Sarntivijal, Oliver Stegle, Jessica Vamathevan, Ewan Birney, Jeffrey Barrett, Ian Dunham and the Open Targets team
Target Validation Platform: A New Drug Target Identification and Prioritisation Web Platform
Genome informatics 2016
2016
Monika Fischer, Mohamad Bittar, Eliseo Papa, Zain Kassam, Mark Smith
Can you cause IBD with fecal transplantation?: 31-patient case series of fecal transplantation using stool from a donor who later developed Crohn’s disease
Digestive Disease Week 2016
2016
Jessica Allegretti, Monika Fischer, Eliseo Papa, Ryan Elliot, M. Jane Klank, Gina Mendolia, Elaine Vo, Zain Kassam, Mark Smith
Fecal Microbiota Transplantation Delivered via Oral Capsules Achieves Microbial Engraftment Similar to Traditional
Delivery Modalities: Safety, Efficacy and Engraftment Results From a Multi-Center Cluster Randomized Dose-Finding Study
Digestive Disease Week 2016
2012
White RR, Morrow M, Miyata S, Papa E, Spooner E, Selkirk M, Gounaris K, Das C, Artavanis-Tsakonas K
Characterisation of the Trichinella Spiralis Deubiquitinating Enzyme, TsUCH37
Molecular and Cellular Biology of Helminth Parasites VII
2012
Eliseo Papa, Michael Docktor, Christopher Smillie, Sarah Weber, Sarah P. Preheim, Dirk Gevers, Georgia Giannoukos, Dawn Ciulla, Diana Tabbaa, Jay Ingram, David B Schauer, Doyle V Ward, Joshua R Korzenik, Ramnik J Xavier, Athos Bousvaros, Eric J Alm.
Diagnosing IBD from the fecal microbiome
Exploring Human Host-Microbiome Interactions in Health and Disease, Wellcome Trust Scientific Conferences
2008
High-Throughput and High-Content Screening of Antibody Responses from Single Cells
AIChE annual meeting, Nanoscale Science Engineering Forum
2008
Applying Ligands to B Cell Receptors by Microfluidics
AIChE annual meeting, Engineering Fundamentals in Life Sciences
2008
Microengraving for high-throughput affinity mapping of humoral responses
Harvard/MIT HST Forum
2008
Multi-variate profiling of B cell immune responses
Novartis Vaccine Symposium
Patents
2009
Composition of an Array of Microwells with an Integrated Microfluidic System, US8569046B2
Volunteering
2005
Field Operative, AISPO, San Raffaele del Monte Tabor Foundation. Milan, Italy
- Streamlined diagnostic routines, Kampala’s Hospital & Gulu’s outpost, Uganda
- Consulted regarding the infrastructure, human resources and logistics of the Kampala’s hospital
2004
Engineers Without Borders.
- University of Toronto Conference delegate