In the same section
- Polytech
- 3BIO
- BioInfo
- Thesis and Intership subjects
-
Share this page
Master Thesis subjects proposed by 3BIO-BioInfo Computational Biology and Bioinformatics
Our lab
Computational biology and bioinformatics is a field that permeates through all levels of science, from fundamental research to technology development and application. Our lab focuses on the development of bioinformatic tools and computational approaches that can be used both for investigating fundamental questions in biology and designing practical solutions particularly in biotechnology and medicine.
The methods used to build these tools are varied but mainly revolve around artificial intelligence, machine learning, data analysis, statistical physics, stochastic modelling and quantum chemistry. Experimental data is of paramount importance for most of these methods and we have therefore established strong connections with experimental labs from both academia and the industry. Moreover, our lab is part of a network of collaborators within the ULB-VUB (IB)2 Interuniversity Institute of Bioinformatics in Brussels.
The bulk of our work consists in developing a suite of tools that enables the prediction of the changes caused by mutations on the biophysics of proteins (stability, solubility, binding affinity, …) and DNA (ionisation potential). These tools can be applied to study fundamental questions such as the identification and modelling of the mechanisms of natural selection, as well as bringing solutions to the design and optimisation of enzymes used in biotechnological processes or the identification of deleterious variants in diseases like cancer and autism.
Furthermore, we are developing tools for the prediction of epitopes in antigens, which can be applied to vaccine and antibody design, and the understanding of the mechanisms behind certain cancers, auto-immune diseases and allergies.
Finally, we are also working on the mathematical modelling of complex biological systems in order to gain insight, for example, on the role of noisy interactions in systems like genetic circuits or chemical reaction networks or to study how known physiological pathways are perturbed in infection and diseases.
.Schematic representation of our lab’s topics of research (blue) according to the methods used (red) and their relevant fields of study (green).
Topics
1. Rational design of modified proteins [ Fabrizio Pucci & Marianne Rooman ]
Proteins carry out a large variety of biological functions in living organisms. Some proteins, for example, act as particularly specific and efficient catalysts. The possibility to exploit the functional properties of proteins in industrial applications (food industry, chemical processes, pharmaceutical developments, etc.) is extremely interesting; however, one of its major limitations is that proteins generally lose their stability and activity under non-physiological conditions. The ability to design modified proteins that remain structured and active at higher (or lower) temperatures is therefore an important objective of research. Moreover, the understanding of how changes in the coding sequence of proteins affect their biophysical properties is essential in fundamental research such as molecular evolution and phylogeny. This project consists in developing efficient and fast computational tools to predict changes in stability, solubility or interactions of proteins upon mutations, which are applicable on a genome-wide scale. This software will be applied to rationally modify proteins of industrial interest and increasing their efficiency within a given application, or to study fundamental questions of interest in biophysics and molecular evolution. This project can be done within our group only, or in collaboration with experimental laboratories in academia or industry where the predictions will be tested.
2. Prediction of disease-causing variants in the human genome [ Fabrizio Pucci & Marianne Rooman ]
Next Generation Sequencing technologies produces massive amounts of genome data that are revolutionizing biological and medical research, and paves the way towards personalized medicine. Among the exome variants that lead to amino acid mutations, most are neutral in the sense that they only modify the individual’s phenotype, but some are the cause of diseases. The identification of deleterious mutations and their characterization are of prime importance for setting up personalized therapies. This project consists in developing and applying bioinformatics tools to predict disease-causing protein variants and trying to understand why they are so, in terms of protein characteristics such as stability, solubility, flexibility and function, or in terms of DNA characteristics such as its ionisation potential. More specifically, we are currently characterizing, at the molecular level, mutations involved in rare diseases, cancer and neurodevelopmental disorders such as autism spectrum disorder. The project is in collaboration with geneticists and clinicians (e.g., Hôpital Erasme, BE, Dana Farber Cancer Institute, USA, University of California San Diego School of Medicine, USA).
3. Monoclonal Antibody design [Fabrizio Pucci & Marianne Rooman]
4. Mathematical modeling of the Renin-Angiotensin System [Marianne Rooman & Fabrizio Pucci]
Synthetic biology is a relatively new field whose focus is on improving already existing and engineering new gene circuits. In order to implement a new gene circuit one must understand its behavior in terms of the system parameters. This requires that we understand not only the system's averaged deterministic behavior, but also how random variations, i.e. internal and external noise, affect its performance. This project consists in modeling systems of increasing complexity, using stochastic differential equations, and analyzing their noise levels defined as the variance of the number of molecules. We will investigate models of various biological systems, such as prokaryotic and eukaryotic gene regulation and protein oligomerization, and attempt to understand the general relationship between noise, complexity, and some key characteristics of the systems.
5. Artificial intelligence methods to design ligands for olfactory receptors [Dimitri Gilis]
The olfactory system relies on protein receptors expressed by olfactory neurons. These olfactory receptors belong to the family of G protein-coupled membrane receptors (GPCR). The relationships between odorant molecules, targeted olfactory receptors and odour perception are complex and not yet well understood. In addition, it has been shown that some olfactory receptors are expressed in tissues other than the olfactory epithelium and may have a physiological or potentially therapeutic role.
This project consists in developing artificial intelligence approaches, allowing (1) to predict the olfactory receptor(s) targeted by an odorant molecule, and (2) to design de novo a molecule able to activate a given olfactory receptor. It is carried out in collaboration with the group of Prof. I. Langer (Faculty of Medicine), which experimentally characterises these systems.
The master thesis topics related to this project can be entirely bioinformatics or include an experimental part.
6. Food and house dust mite allergens [Dimitri Gilis]
Allergy represents an important public health problem. On the one hand, we are developing bioinformatics tools to predict whether a protein corresponds to a food allergen. Such tools are very important for the development of new food products. On the other hand, we are studying certain structural and dynamic properties of house dust mite allergens.
7. Leveraging deep learning model for biomolecule [Fabrizio Pucci & Marianne Rooman & Dimitri Gilis]
Deep Learning algorithms are revolutionizing biomolecular science. For example, the AlphaFold2 and RosettaTR algorithms recently succeeded in pushing the accuracy of 3D protein structure prediction close to that of experimental methods. In this line of research, we will explore complex deep learning architectures to predict targeted biophysical characteristics of proteins. We propose two subtopics. In the first, pretrained attention-based language models (LM), such as ProteinBert, will be used and fine-tuned on downstream tasks such as protein localization or melting temperature predictions. Depending on the availability of computational resources, we will also consider direct training of BERT-like models which use masked LM approaches. The second subtopic is focused on the exploration of generative models for protein design.
.PDF File
Contacts
Pr. Dimitri Gilis: dgilis@ulb.ac.bePr. Fabrizio Pucci: fapucci@ulb.ac.be
Pr. Marianne Rooman : mrooman@ulb.ac.be