rcosta homepage

Computational Systems (Biotechnology & Engineering) – CoSBioE

R Costa

Affiliation: LAQV-REQUIMTE, DQ, Faculty of Science and Technology, NOVA University Lisbon

Short Bio: Rafael Costa holds a PhD degree in Chemical and Biological Engineering. Currently, he is an Assistant Professor at NOVA-FCT, a Researcher at LAQV (Systems Biology & Engineering Lab) and Collaborator member at Center of Intelligent Systems (CIS), IST-UL. His research focuses on systems biology modeling and data science with application in biotechnology, (bio)processes and medicine.

Research Interests

The main topic of my PhD was in metabolic modeling, control and optimization of biological systems — a focus that continues to drive my research. Since my PhD, I have been interested on the development and application of computational approaches, algorithms, models, and databases to Systems Biology and Biotechnology, particularly for system level analysis of microbial metabolism. To this end, I employ different modeling approaches including ODE-based, constraint-based, data-driven and hybrid approaches in close collaboration with experimental groups. I also belong to a consortium of researchers interested in whole-cell modelling using openly available softwares and standards. More recently, my research activities has expanded and I developed an interest in the data science field, with emphasis in systems engineering techniques (such as machine learning algorithms and data mining) for (bio)chemical & biomedical data analysis and hybrid mechanistic/ML methods, in order to improve outcomes and better decision-making. Moreover, I am currently involving on the regular maintenance of KiMoSys, a database for quantitative kinetic models, and IPOscore, a web-based platform for assisting clinical decisions in the oncology domain.

Key-terms of research: Systems Biology modeling, optimization, industrial biotechnology, (bio)process&systems engineering, bioinformatics, metabolism, (bio)process digitalization, computational medicine&biology, medical systems biology, systems biotechnology, data management, Bioindustry 4.0, metabolic engineering, data analytics, bioengineering

Main Topics

We are working on computational solutions to address biotechnological, (bio)chemical and biomedical problems. Our work is highly interdisciplinary (at the interface of bioengineering, computer science, biotechnology, medicine, chemistry and biology), focused at:

☛ Mathematical models of microbial metabolism for sustainable biotechnological applications ([P1], [P6], [P9], [P10]).

☛ Modeling, optimization & automation of bioprocesses and biosystems ([P3], [P4], [P8], [P11]).

☛ Computational tools and methods (machine learning and data mining) to analyse big-datasets coming from diverse sources (e.g., (bio)chemical data [P2], environmental [P7] and clinical/biomedical data [P5]).

Interested in applying computational models to biotechnological and biomedical challenges? Please Contact me

Selected Publications

  • [P1] J. Pinto, J. Ramos, R. S. Costa & R. Oliveira, A General Hybrid Modeling Framework for Systems Biology Applications: Combining Mechanistic Knowledge with Deep Neural Networks under the SBML Standard. AI, 4(1), 303-318, (2023), doi: 10.3390/ai4010014
  • [P2] Alexandre, L., Costa, R.S. & Henriques, R. DI2: prior-free and multi-item discretization of biological data and its applications. BMC Bioinformatics, 22, 426 (2021), doi: 10.1186/s12859-021-04329-8, CODE
  • [P3] Gonçalves, D., Henriques, R., Costa, R.S. Predicting metabolic fluxes from omics data via machine learning: Moving from knowledge-driven towards data-driven approaches. Computational and Structural Biotechnology Journal, 21, 4960-4973, (2023), doi: 10.1016/j.csbj.2023.10.002, CODE
  • [P4] J. Pinto, J. Ramos, R. S. Costa & R. Oliveira. Hybrid Deep Modeling of a GS115 (Mut+) Pichia pastoris Culture with State–Space Reduction. Fermentation, 9(7), 643, (2023), doi: 10.3390/fermentation9070643
  • [P5] Alexandre, L., Costa, R.S., Santos, L. L., Henriques, R. Mining pre-surgical patterns able to discriminate post-surgical outcomes in the oncological domain. IEEE Journal of Biomedical and Health Informatics, 25(7), (2021), doi: 10.1109/JBHI.2021.3064786
  • [P6] Mochao, H., Barahona, P., Costa, R.S. KiMoSys 2.0: an upgraded database for submitting, storing and accessing experimental data for kinetic modeling. Database-The journal of biological database & curation, baaa093, (2020), doi: 10.1093/database/baaa093
  • [P7] Patrício, A., Lopes, M. B., Costa, P.R., Costa, R.S., Henriques, R., Vinga S. Time-Lagged Correlation Analysis of Shellfish Toxicity Reveals Predictive Links to Adjacent Areas, Species, and Environmental Conditions. Toxins, 14 (10), 679, (2022), doi: 10.3390/toxins14100679, CODE
  • [P8] J. Pinto, M. Mestre, J. Ramos, R. S. Costa, G. Striedner, R. Oliveira A general deep hybrid model for bioreactor systems: combining First Principles with deep neural networks. Computers & Chemical Engineering. 165, 107952, (2022), doi: 10.1016/j.compchemeng.2022.107952
  • [P9] Costa, R. S., Hartmann, A., Gaspar, P., Neves, A. R. and Vinga, S. An extended dynamic model of Lactococcus lactis metabolism for mannitol and 2,3-butanediol production. Molecular BioSystems 10, 628-639 (2014), doi: C3MB70265K, SBML MODEL
  • [P10] Machado, D., Costa, R.S., Rocha, I., Tidor B., Ferreira, E.C. Exploring the gap between dynamic and constraint-based models of metabolism. Metabolic Engineering 14 (2), 112-119 (2012), doi: 10.1016/j.ymben.2012.01.003 (Journal Cover Paper)
  • [P11] J. Pinto †, R. S. Costa* †, L. Alexandre, J. Ramos, R. Oliveira. SBML2HYB: a Python interface for SBML compatible hybrid modeling. Bioinformatics. 39(1), btad044, (2023), doi: bioinformatics/btad044, CODE

Full list of publications: Google Scholar - ORCID - ResearcherID

Representative Research Projects

BioLaMerProof of principle fly larvae biorefinery for biopolymer plastic production (HORIZON-EIC), Coordinator: Sibu Padmanabhan. (ongoing)

IPOscorePredicting the risk of complications of surgical treatment and define prognosis of cancer patients through clinical and biopathological data integration, Data Science and Artificial Intelligence in Public Administration project grant, (FCT). Role: Principal Investigator.

PROMICONHarnessing the power of Nature through productive microbial consortia in biotechnology – measure, model, master (H2020-FNR-12-2020), Coordinator: Jens Kromer. (ongoing)

BacHBerryBACterial Hosts for production of Bioactive phenolics from bERRY fruits (FP7-KBBE.2013.3.1-01), Role: Team member, PI: Jochen Forster.

Software & Tools

DISA, a software package in Python capable of assessing patterns with numerical outputs by statistically testing the correlation gain of the subspace against the overall space. To illustrate the DISA properties, a dataset that monitors the concentration of key enzymes observed in two Design-Build-Test-Learn (DBTL) cycles of 1-dodecanol production in E. coli is presented.

score4Covid, a clinical decision support system for Covid-19 to anticipate medical needs and infection outcomes using demographic and comorbidity variables, as well as onset date of symptoms, test and hospitalization.

DI2 tool, a python package that uses non-parametric tests to find the best fitting distribution for a given variable and discretize it accordingly. To illustrate the DI2 tool properties two case studies were analyzed including a yeast experimental dataset.

KiMoSys database. An experimental data repository for Kinetic Models of biological SYStems. Available at kimosys.org

ObjComparison. Matlab implementation for investigation the effect of cellular objective function and constraints in flux balance constraint-based models. Implementation of all objective functions and constraints can be adapted to test different metabolic systems. Available at http://bit.ly/17QACEW

SON-EM. An algorithm for hybrid system identification of Switched affine AutoRegressive model with eXogenous inputs (SARX) models using Sum of Norms regularization and Expectation Maximization. Available at http://bit.ly/1DaGaVS

SBML2HYB, a Python tool for SBML compatible hybrid modelling of biological systems. Two case studies illustrate the use of SBML2HYB: threonine synthesis pathway model and Park and Ramirez model.

TriSig, a method designed to evaluate the statistical significance of patterns in tensor data. To demonstrate the properties of TriSig, a industrial penicillin batches dataset and a metabolomics dataset from urinary samples analyzed by NMR spectroscop are used.

HYBpy, a web-based framework for hybrid modeling of bioprocesses and biological systems.

Group Members

PhD Thesis IN PROGRESS
-Leonardo Alexandre ID 94054, Co-supervised with Rui Henriques (INESC-ID/IST).
-Daniel Gonçalves ID 194007, Co-supervised with Rui Henriques (INESC-ID/IST) and José A. Ferreira (IPO-Porto).
-André Patrício ID 187631, Co-supervised with Rui Henriques (INESC-ID/IST).
-Samuel Barbosa ID 202302286, Co-supervised with Lúcio L. Santos (IPO-Porto) and Daniel M. Gonçalves (UP).
-Monesh Thirugnanasambandam ID, Co-supervised with Rui Oliveira (NOVA-FCT)

MSc Thesis IN PROGRESS
-André Matos ID 55358, Desenvolvimento de uma base dados da estrutura espacial em Biofilmes. Co-supervised with Pedro Barahona (NOVA-FCT) and Nuno Azevedo (LEPABE-FEUP).
-José Pedreira ID 61835 , HYBpy: a Python tool for hybrid modeling of bioprocesses. Co-supervised with Pedro Barahona (FCT-NOVA) and Rui Oliveira (NOVA-FCT).
-Maria do Carmo Ribeiro ID 96436, Predicting cardiomyocytes content using data-driven approaches. Co-supervised with Duarte Valério (IST).
-Henrique Anjos ID , Prediction of structural heart disease risk. Co-supervised with Rui Henriques (INESC-ID).
-Nuno Bragança ID 68831, “Knowledge transfer for biopharmaceutical production: data analysis of different process development campaigns”, Co-supervised with Rui Oliveira (NOVA-FCT)
-Ekaterina Moskovkina ID 68824, Co-supervised wth Rui Oliveira (NOVA-FCT)

Recent Teaching Activities

[2024-present] – Bioprocess Engineering (Spring semester), Master in Chemical and Biological Engineering;
[2022-present] – Synthetic and Systems Biology (Spring semester), Master in Biotechnology;
[2021-present] – Numerical Methods Applied to Chemical and Biological Engineering (Fall semester), Master in Chemical and Biological Engineering;
[2021-present] – Systems Biology (Spring semester), Master in Computational Biology and Bioinformatics;


© R Costa 2024
https://www.fct.unl.pt/