Project acronym BACTERIAL SYRINGES
Project Protein Translocation Through Bacterial Syringes
Researcher (PI) Stefan Raunser
Host Institution (HI) MAX-PLANCK-GESELLSCHAFT ZUR FORDERUNG DER WISSENSCHAFTEN EV
Call Details Consolidator Grant (CoG), LS1, ERC-2013-CoG
Summary "The main objective of this application is to study the molecular basis of cellular infection by bacterial ABC-type toxins (Tc). Tc complexes are important virulence factors of a range of bacteria, including Photorhabdus luminescens and Yersinia pseudotuberculosis that infect insects and humans. Belonging to the class of pore-forming toxins, tripartite Tc complexes perforate the host membrane by forming channels that translocate toxic enzymes into the host.
In our previous cryo-EM work on the P. luminescens Tc complex we discovered that Tcs use a special syringe-like device for cell entry. Building on these results, we now intend to unravel the molecular mechanism through which this unusual and complicated injection system allows membrane permeation and protein translocation. We will use a hybrid approach, including biochemical reconstitution, structural analysis by cryo-EM and X-ray crystallography, fluorescence-based assays and site-directed mutagenesis to provide a comprehensive description of the molecular mechanism of infection at an unprecedented level of molecular detail.
Our results will be paradigmatic for understanding the mechanism of action of ABC-type toxins and will shed new light on the interactions of bacterial pathogens with their hosts."
Summary
"The main objective of this application is to study the molecular basis of cellular infection by bacterial ABC-type toxins (Tc). Tc complexes are important virulence factors of a range of bacteria, including Photorhabdus luminescens and Yersinia pseudotuberculosis that infect insects and humans. Belonging to the class of pore-forming toxins, tripartite Tc complexes perforate the host membrane by forming channels that translocate toxic enzymes into the host.
In our previous cryo-EM work on the P. luminescens Tc complex we discovered that Tcs use a special syringe-like device for cell entry. Building on these results, we now intend to unravel the molecular mechanism through which this unusual and complicated injection system allows membrane permeation and protein translocation. We will use a hybrid approach, including biochemical reconstitution, structural analysis by cryo-EM and X-ray crystallography, fluorescence-based assays and site-directed mutagenesis to provide a comprehensive description of the molecular mechanism of infection at an unprecedented level of molecular detail.
Our results will be paradigmatic for understanding the mechanism of action of ABC-type toxins and will shed new light on the interactions of bacterial pathogens with their hosts."
Max ERC Funding
1 999 992 €
Duration
Start date: 2014-07-01, End date: 2019-06-30
Project acronym BALANCE
Project Mapping Dispersion Spectroscopically in Large Gas-Phase Molecular Ions
Researcher (PI) Peter CHEN
Host Institution (HI) EIDGENOESSISCHE TECHNISCHE HOCHSCHULE ZUERICH
Call Details Advanced Grant (AdG), PE4, ERC-2018-ADG
Summary We use IR spectroscopy of trapped ions in a cryogenic FT-ICR spectrometer to probe non-covalent, “dispersion” interactions in large, gas-phase molecular ions. We will measure conformational equilibria by N-H frequency shifts, and correlate gas-phase IR frequency to the N-H-N bond angle in an ionic H-bond. Substituents on “onium” cations can adopt various conformations, whose energies map interaction potentials. Substituents on their proton-bound dimers interact non-covalently through dispersion forces, whose quantitative evaluation in large molecules has remained difficult despite dispersion becoming increasingly cited as a design principle in the construction of catalysts and materials. The non-covalent interactions bend the N-H-N bond, leading to large shifts in the IR frequency. The proton-bound dimer acts like a molecular balance where the non-covalent interaction, is set against the bending potential in an ionic hydrogen bond. Despite encouragingly accurate calculations for small molecules, experimental benchmarks for large molecules in the gas phase remain scarce, and there is evidence that the good results for small molecules may not extrapolate reliably to large molecules. The present proposal introduces a new experimental probe of non-covalent interactions, providing a sensitive test of the diverging results coming from various computational methods and other experiments. The experiment must be done on isolated molecules in the gas phase, as previous work has shown that solvation substantially cancels out the attractive potential. Accordingly, the proposed experimental design, which involves a custom-built spectrometer, newly available tunable IR sources, chemical synthesis of custom substrates, and quantum calculations up to coupled-cluster levels of theory, showcases how an interdisciplinary approach combining physical and organic chemistry can solve a fundamental problem that impacts how we understand steric effects in organic chemistry.
Summary
We use IR spectroscopy of trapped ions in a cryogenic FT-ICR spectrometer to probe non-covalent, “dispersion” interactions in large, gas-phase molecular ions. We will measure conformational equilibria by N-H frequency shifts, and correlate gas-phase IR frequency to the N-H-N bond angle in an ionic H-bond. Substituents on “onium” cations can adopt various conformations, whose energies map interaction potentials. Substituents on their proton-bound dimers interact non-covalently through dispersion forces, whose quantitative evaluation in large molecules has remained difficult despite dispersion becoming increasingly cited as a design principle in the construction of catalysts and materials. The non-covalent interactions bend the N-H-N bond, leading to large shifts in the IR frequency. The proton-bound dimer acts like a molecular balance where the non-covalent interaction, is set against the bending potential in an ionic hydrogen bond. Despite encouragingly accurate calculations for small molecules, experimental benchmarks for large molecules in the gas phase remain scarce, and there is evidence that the good results for small molecules may not extrapolate reliably to large molecules. The present proposal introduces a new experimental probe of non-covalent interactions, providing a sensitive test of the diverging results coming from various computational methods and other experiments. The experiment must be done on isolated molecules in the gas phase, as previous work has shown that solvation substantially cancels out the attractive potential. Accordingly, the proposed experimental design, which involves a custom-built spectrometer, newly available tunable IR sources, chemical synthesis of custom substrates, and quantum calculations up to coupled-cluster levels of theory, showcases how an interdisciplinary approach combining physical and organic chemistry can solve a fundamental problem that impacts how we understand steric effects in organic chemistry.
Max ERC Funding
2 446 125 €
Duration
Start date: 2019-05-01, End date: 2024-04-30
Project acronym BANDWIDTH
Project The cost of limited communication bandwidth in distributed computing
Researcher (PI) Keren CENSOR-HILLEL
Host Institution (HI) TECHNION - ISRAEL INSTITUTE OF TECHNOLOGY
Call Details Starting Grant (StG), PE6, ERC-2017-STG
Summary Distributed systems underlie many modern technologies, a prime example being the Internet. The ever-increasing abundance of distributed systems necessitates their design and usage to be backed by strong theoretical foundations.
A major challenge that distributed systems face is the lack of a central authority, which brings many aspects of uncertainty into the environment, in the form of unknown network topology or unpredictable dynamic behavior. A practical restriction of distributed systems, which is at the heart of this proposal, is the limited bandwidth available for communication between the network components.
A central family of distributed tasks is that of local tasks, which are informally described as tasks which are possible to solve by sending information through only a relatively small number of hops. A cornerstone example is the need to break symmetry and provide a better utilization of resources, which can be obtained by the task of producing a valid coloring of the nodes given some small number of colors. Amazingly, there are still huge gaps between the known upper and lower bounds for the complexity of many local tasks. This holds even if one allows powerful assumptions of unlimited bandwidth. While some known algorithms indeed use small messages, the complexity gaps are even larger compared to the unlimited bandwidth case. This is not a mere coincidence, and in fact the existing theoretical infrastructure is provably incapable of
giving stronger lower bounds for many local tasks under limited bandwidth.
This proposal zooms in on this crucial blind spot in the current literature on the theory of distributed computing, namely, the study of local tasks under limited bandwidth. The goal of this research is to produce fast algorithms for fundamental distributed local tasks under restricted bandwidth, as well as understand their limitations by providing lower bounds.
Summary
Distributed systems underlie many modern technologies, a prime example being the Internet. The ever-increasing abundance of distributed systems necessitates their design and usage to be backed by strong theoretical foundations.
A major challenge that distributed systems face is the lack of a central authority, which brings many aspects of uncertainty into the environment, in the form of unknown network topology or unpredictable dynamic behavior. A practical restriction of distributed systems, which is at the heart of this proposal, is the limited bandwidth available for communication between the network components.
A central family of distributed tasks is that of local tasks, which are informally described as tasks which are possible to solve by sending information through only a relatively small number of hops. A cornerstone example is the need to break symmetry and provide a better utilization of resources, which can be obtained by the task of producing a valid coloring of the nodes given some small number of colors. Amazingly, there are still huge gaps between the known upper and lower bounds for the complexity of many local tasks. This holds even if one allows powerful assumptions of unlimited bandwidth. While some known algorithms indeed use small messages, the complexity gaps are even larger compared to the unlimited bandwidth case. This is not a mere coincidence, and in fact the existing theoretical infrastructure is provably incapable of
giving stronger lower bounds for many local tasks under limited bandwidth.
This proposal zooms in on this crucial blind spot in the current literature on the theory of distributed computing, namely, the study of local tasks under limited bandwidth. The goal of this research is to produce fast algorithms for fundamental distributed local tasks under restricted bandwidth, as well as understand their limitations by providing lower bounds.
Max ERC Funding
1 486 480 €
Duration
Start date: 2018-06-01, End date: 2023-05-31
Project acronym BAS-SBBT
Project Bacterial Amyloid Secretion: Structural Biology and Biotechnology.
Researcher (PI) Han Karel Remaut
Host Institution (HI) VIB
Call Details Consolidator Grant (CoG), LS1, ERC-2014-CoG
Summary Curli are functional amyloid fibers that constitute the major protein component of the extracellular matrix in pellicle biofilms formed by Bacteroidetes and Proteobacteria. Unlike the protein misfolding and aggregation events seen in pathological amyloid diseases such as Alzheimer’s and Parkinson’s disease, curli are the product of a dedicated protein secretion machinery. Curli formation requires a specialised and mechanistically unique transporter in the bacterial outer membrane, as well as two soluble accessory proteins thought to facilitate the safe guidance of the curli subunits across the periplasm and to coordinate their self-assembly at cell surface.
In this interdisciplinary research program we will study the structural and molecular biology of E. coli curli biosynthesis and address the fundamental questions concerning the molecular processes that allow the spatially and temporally controlled transport and deposition of these pro-amyloidogenic polypeptides. We will structurally unravel the secretion machinery, trap and analyse critical secretion intermediates and through in vitro reconstitution, assemble a minimal, self-sufficient peptide transport and fiber assembly system.
The new insights gained will set the stage for targeted interventions in curli -mediated biofilm formation and this research project will develop a new framework to harness the unique properties found in curli structure and biosynthesis for biotechnological applications as in patterned functionalized nanowires and directed, selective peptide carriers.
Summary
Curli are functional amyloid fibers that constitute the major protein component of the extracellular matrix in pellicle biofilms formed by Bacteroidetes and Proteobacteria. Unlike the protein misfolding and aggregation events seen in pathological amyloid diseases such as Alzheimer’s and Parkinson’s disease, curli are the product of a dedicated protein secretion machinery. Curli formation requires a specialised and mechanistically unique transporter in the bacterial outer membrane, as well as two soluble accessory proteins thought to facilitate the safe guidance of the curli subunits across the periplasm and to coordinate their self-assembly at cell surface.
In this interdisciplinary research program we will study the structural and molecular biology of E. coli curli biosynthesis and address the fundamental questions concerning the molecular processes that allow the spatially and temporally controlled transport and deposition of these pro-amyloidogenic polypeptides. We will structurally unravel the secretion machinery, trap and analyse critical secretion intermediates and through in vitro reconstitution, assemble a minimal, self-sufficient peptide transport and fiber assembly system.
The new insights gained will set the stage for targeted interventions in curli -mediated biofilm formation and this research project will develop a new framework to harness the unique properties found in curli structure and biosynthesis for biotechnological applications as in patterned functionalized nanowires and directed, selective peptide carriers.
Max ERC Funding
1 989 489 €
Duration
Start date: 2015-06-01, End date: 2020-05-31
Project acronym BASTION
Project Leveraging Binary Analysis to Secure the Internet of Things
Researcher (PI) Thorsten Holz
Host Institution (HI) RUHR-UNIVERSITAET BOCHUM
Call Details Starting Grant (StG), PE6, ERC-2014-STG
Summary We are in the midst of the shift towards the Internet of Things (IoT), where more and more (legacy) devices are connected to the Internet and communicate with each other. This paradigm shift brings new security challenges and unfortunately many current security solutions are not applicable anymore, e.g., because of a lack of clear network boundaries or resource-constrained devices. However, security plays a central role: In addition to its classical function in protecting against manipulation and fraud, it also enables novel applications and innovative business models.
We propose a research program that leverages binary analysis techniques to improve the security within the IoT. We concentrate on the software level since this enables us to both analyze a given device for potential security vulnerabilities and add security features to harden the device against future attacks. More specifically, we concentrate on the firmware (i.e., the combination of persistent memory together with program code and data that powers such devices) and develop novel mechanism for binary analysis of such software. We design an intermediate language to abstract away from the concrete assembly level and this enables an analysis of many different platforms within a unified analysis framework. We transfer and extend program analysis techniques such as control-/data-flow analysis or symbolic execution and apply them to our IL. Given this novel toolset, we can analyze security properties of a given firmware image (e.g., uncovering undocumented functionality and detecting memory corruption or logical vulnerabilities,). We also explore how to harden a firmware by retrofitting security mechanisms (e.g., adding control-flow integrity or automatically eliminating unnecessary functionality). This research will deepen our fundamental understanding of binary analysis methods and apply it to a novel area as it lays the foundations of performing this analysis on the level of intermediate languages.
Summary
We are in the midst of the shift towards the Internet of Things (IoT), where more and more (legacy) devices are connected to the Internet and communicate with each other. This paradigm shift brings new security challenges and unfortunately many current security solutions are not applicable anymore, e.g., because of a lack of clear network boundaries or resource-constrained devices. However, security plays a central role: In addition to its classical function in protecting against manipulation and fraud, it also enables novel applications and innovative business models.
We propose a research program that leverages binary analysis techniques to improve the security within the IoT. We concentrate on the software level since this enables us to both analyze a given device for potential security vulnerabilities and add security features to harden the device against future attacks. More specifically, we concentrate on the firmware (i.e., the combination of persistent memory together with program code and data that powers such devices) and develop novel mechanism for binary analysis of such software. We design an intermediate language to abstract away from the concrete assembly level and this enables an analysis of many different platforms within a unified analysis framework. We transfer and extend program analysis techniques such as control-/data-flow analysis or symbolic execution and apply them to our IL. Given this novel toolset, we can analyze security properties of a given firmware image (e.g., uncovering undocumented functionality and detecting memory corruption or logical vulnerabilities,). We also explore how to harden a firmware by retrofitting security mechanisms (e.g., adding control-flow integrity or automatically eliminating unnecessary functionality). This research will deepen our fundamental understanding of binary analysis methods and apply it to a novel area as it lays the foundations of performing this analysis on the level of intermediate languages.
Max ERC Funding
1 472 269 €
Duration
Start date: 2015-03-01, End date: 2020-02-29
Project acronym BATNMR
Project Development and Application of New NMR Methods for Studying Interphases and Interfaces in Batteries
Researcher (PI) Clare GREY
Host Institution (HI) THE CHANCELLOR MASTERS AND SCHOLARS OF THE UNIVERSITY OF CAMBRIDGE
Call Details Advanced Grant (AdG), PE4, ERC-2018-ADG
Summary The development of longer lasting, higher energy density and cheaper rechargeable batteries represents one of the major technological challenges of our society, batteries representing the limiting components in the shift from gasoline-powered to electric vehicles. They are also required to enable the use of more (typically intermittent) renewable energy, to balance demand with generation. This proposal seeks to develop and apply new NMR metrologies to determine the structure and dynamics of the multiple electrode-electrolyte interfaces and interphases that are present in these batteries, and how they evolve during battery cycling. New dynamic nuclear polarization (DNP) techniques will be exploited to extract structural information about the interface between the battery electrode and the passivating layers that grow on the electrode materials (the solid electrolyte interphase, SEI) and that are inherent to the stability of the batteries. The role of the SEI (and ceramic interfaces) in controlling lithium metal dendrite growth will be determined in liquid based and all solid state batteries.
New DNP approaches will be developed that are compatible with the heterogeneous and reactive species that are present in conventional, all-solid state, Li-air and redox flow batteries. Method development will run in parallel with the use of DNP approaches to determine the structures of the various battery interfaces and interphases, testing the stability of conventional biradicals in these harsh oxidizing and reducing conditions, modifying the experimental approaches where appropriate. The final result will be a significantly improved understanding of the structures of these phases and how they evolve on cycling, coupled with strategies for designing improved SEI structures. The nature of the interface between a lithium metal dendrite and ceramic composite will be determined, providing much needed insight into how these (unwanted) dendrites grow in all solid state batteries. DNP approaches coupled with electron spin resonance will be use, where possible in situ, to determine the reaction mechanisms of organic molecules such as quinones in organic-based redox flow batteries in order to help prevent degradation of the electrochemically active species.
This proposal involves NMR method development specifically designed to explore a variety of battery chemistries. Thus, this proposal is interdisciplinary, containing both a strong emphasis on materials characterization, electrochemistry and electronic structures of materials, interfaces and nanoparticles, and on analytical and physical chemistry. Some of the methodology will be applicable to other materials and systems including (for example) other electrochemical technologies such as fuel cells and solar fuels and the study of catalysts (to probe surface structure).
Summary
The development of longer lasting, higher energy density and cheaper rechargeable batteries represents one of the major technological challenges of our society, batteries representing the limiting components in the shift from gasoline-powered to electric vehicles. They are also required to enable the use of more (typically intermittent) renewable energy, to balance demand with generation. This proposal seeks to develop and apply new NMR metrologies to determine the structure and dynamics of the multiple electrode-electrolyte interfaces and interphases that are present in these batteries, and how they evolve during battery cycling. New dynamic nuclear polarization (DNP) techniques will be exploited to extract structural information about the interface between the battery electrode and the passivating layers that grow on the electrode materials (the solid electrolyte interphase, SEI) and that are inherent to the stability of the batteries. The role of the SEI (and ceramic interfaces) in controlling lithium metal dendrite growth will be determined in liquid based and all solid state batteries.
New DNP approaches will be developed that are compatible with the heterogeneous and reactive species that are present in conventional, all-solid state, Li-air and redox flow batteries. Method development will run in parallel with the use of DNP approaches to determine the structures of the various battery interfaces and interphases, testing the stability of conventional biradicals in these harsh oxidizing and reducing conditions, modifying the experimental approaches where appropriate. The final result will be a significantly improved understanding of the structures of these phases and how they evolve on cycling, coupled with strategies for designing improved SEI structures. The nature of the interface between a lithium metal dendrite and ceramic composite will be determined, providing much needed insight into how these (unwanted) dendrites grow in all solid state batteries. DNP approaches coupled with electron spin resonance will be use, where possible in situ, to determine the reaction mechanisms of organic molecules such as quinones in organic-based redox flow batteries in order to help prevent degradation of the electrochemically active species.
This proposal involves NMR method development specifically designed to explore a variety of battery chemistries. Thus, this proposal is interdisciplinary, containing both a strong emphasis on materials characterization, electrochemistry and electronic structures of materials, interfaces and nanoparticles, and on analytical and physical chemistry. Some of the methodology will be applicable to other materials and systems including (for example) other electrochemical technologies such as fuel cells and solar fuels and the study of catalysts (to probe surface structure).
Max ERC Funding
3 498 219 €
Duration
Start date: 2019-10-01, End date: 2024-09-30
Project acronym BayCellS
Project A Bayesian Framework for Cellular Structural Biology
Researcher (PI) Michael Nilges
Host Institution (HI) INSTITUT PASTEUR
Call Details Advanced Grant (AdG), LS1, ERC-2011-ADG_20110310
Summary The functioning of a single cell or organism is governed by the laws of chemistry and physics. The bridge from biology to chemistry and physics is provided by structural biology: to understand the functioning of a cell, it is necessary to know the atomic structure of macromolecular assemblies, which may contain hundreds of components. To characterise the structures of the increasingly large and often flexible complexes, high resolution structure determination (as was possible for example for the ribosome) will likely stay the exception, and multiple sources of structural data at multiple resolutions are employed. Integrating these data into one consistent picture poses particular difficulties, since data are much more sparse than in high resolution methods, and the data sets from heterogeneous sources are of highly different and unknown quality and may be mutually inconsistent, and that data are in general averaged over large ensembles and long times. Molecular modelling, a crucial element of any structure determination, plays an even more important role in these multi-scale and multi-technique approaches, not only to obtain structures from the data, but also to evaluate their reliability. This proposal is to develop a consistent framework for this highly complex data integration problem, principally based on Bayesian probability theory. Appropriate models for the major types data types used in hybrid approaches will be developed, as well as representations to include structural knowledge for the components of the complexes, at multiple scales. The new methods will be applied to a series of problems with increasing complexity, going from the determination of protein complexes with high resolution information, over low resolution structures based on protein-protein interaction data such as the nuclear pore, to the genome organisation in the nucleus.
Summary
The functioning of a single cell or organism is governed by the laws of chemistry and physics. The bridge from biology to chemistry and physics is provided by structural biology: to understand the functioning of a cell, it is necessary to know the atomic structure of macromolecular assemblies, which may contain hundreds of components. To characterise the structures of the increasingly large and often flexible complexes, high resolution structure determination (as was possible for example for the ribosome) will likely stay the exception, and multiple sources of structural data at multiple resolutions are employed. Integrating these data into one consistent picture poses particular difficulties, since data are much more sparse than in high resolution methods, and the data sets from heterogeneous sources are of highly different and unknown quality and may be mutually inconsistent, and that data are in general averaged over large ensembles and long times. Molecular modelling, a crucial element of any structure determination, plays an even more important role in these multi-scale and multi-technique approaches, not only to obtain structures from the data, but also to evaluate their reliability. This proposal is to develop a consistent framework for this highly complex data integration problem, principally based on Bayesian probability theory. Appropriate models for the major types data types used in hybrid approaches will be developed, as well as representations to include structural knowledge for the components of the complexes, at multiple scales. The new methods will be applied to a series of problems with increasing complexity, going from the determination of protein complexes with high resolution information, over low resolution structures based on protein-protein interaction data such as the nuclear pore, to the genome organisation in the nucleus.
Max ERC Funding
2 130 212 €
Duration
Start date: 2012-05-01, End date: 2017-04-30
Project acronym BAYES-KNOWLEDGE
Project Effective Bayesian Modelling with Knowledge before Data
Researcher (PI) Norman Fenton
Host Institution (HI) QUEEN MARY UNIVERSITY OF LONDON
Call Details Advanced Grant (AdG), PE6, ERC-2013-ADG
Summary This project aims to improve evidence-based decision-making. What makes it radical is that it plans to do this in situations (common for critical risk assessment problems) where there is little or even no data, and hence where traditional statistics cannot be used. To address this problem Bayesian analysis, which enables domain experts to supplement observed data with subjective probabilities, is normally used. As real-world problems typically involve multiple uncertain variables, Bayesian analysis is extended using a technique called Bayesian networks (BNs). But, despite many great benefits, BNs have been under-exploited, especially in areas where they offer the greatest potential for improvements (law, medicine and systems engineering). This is mainly because of widespread resistance to relying on subjective knowledge. To address this problem much current research assumes sufficient data are available to make the expert’s input minimal or even redundant; with such data it may be possible to ‘learn’ the underlying BN model. But this approach offers nothing when there is limited or no data. Even when ‘big’ data are available the resulting models may be superficially objective but fundamentally flawed as they fail to capture the underlying causal structure that only expert knowledge can provide.
Our solution is to develop a method to systemize the way expert driven causal BN models can be built and used effectively either in the absence of data or as a means of determining what future data is really required. The method involves a new way of framing problems and extensions to BN theory, notation and tools. Working with relevant domain experts, along with cognitive psychologists, our methods will be developed and tested experimentally on real-world critical decision-problems in medicine, law, forensics, and transport. As the work complements current data-driven approaches, it will lead to improved BN modelling both when there is extensive data as well as none.
Summary
This project aims to improve evidence-based decision-making. What makes it radical is that it plans to do this in situations (common for critical risk assessment problems) where there is little or even no data, and hence where traditional statistics cannot be used. To address this problem Bayesian analysis, which enables domain experts to supplement observed data with subjective probabilities, is normally used. As real-world problems typically involve multiple uncertain variables, Bayesian analysis is extended using a technique called Bayesian networks (BNs). But, despite many great benefits, BNs have been under-exploited, especially in areas where they offer the greatest potential for improvements (law, medicine and systems engineering). This is mainly because of widespread resistance to relying on subjective knowledge. To address this problem much current research assumes sufficient data are available to make the expert’s input minimal or even redundant; with such data it may be possible to ‘learn’ the underlying BN model. But this approach offers nothing when there is limited or no data. Even when ‘big’ data are available the resulting models may be superficially objective but fundamentally flawed as they fail to capture the underlying causal structure that only expert knowledge can provide.
Our solution is to develop a method to systemize the way expert driven causal BN models can be built and used effectively either in the absence of data or as a means of determining what future data is really required. The method involves a new way of framing problems and extensions to BN theory, notation and tools. Working with relevant domain experts, along with cognitive psychologists, our methods will be developed and tested experimentally on real-world critical decision-problems in medicine, law, forensics, and transport. As the work complements current data-driven approaches, it will lead to improved BN modelling both when there is extensive data as well as none.
Max ERC Funding
1 572 562 €
Duration
Start date: 2014-04-01, End date: 2018-03-31
Project acronym BDE
Project Beyond Distance Estimates: A New Theory of Heuristics for State-Space Search
Researcher (PI) Malte HELMERT
Host Institution (HI) UNIVERSITAT BASEL
Call Details Consolidator Grant (CoG), PE6, ERC-2018-COG
Summary "Many problems in computer science can be cast as state-space search, where the
objective is to find a path from an initial state to a goal state in a
directed graph called a ""state space"". State-space search is challenging due
to the state explosion problem a.k.a. ""curse of dimensionality"": interesting
state spaces are often astronomically large, defying brute-force exploration.
State-space search has been a core research problem in Artificial Intelligence
since its early days and is alive as ever. Every year, a substantial fraction
of research published at the ICAPS and SoCS conferences is concerned with
state-space search, and the topic is very active at general AI conferences
such as IJCAI and AAAI.
Algorithms in the A* family, dating back to 1968, are still the go-to approach
for state-space search. A* is a graph search algorithm whose only
""intelligence"" stems from a so-called ""heuristic function"", which estimates
the distance from a state to the nearest goal state. The efficiency of A*
depends on the accuracy of this estimate, and decades of research have pushed
the envelope in devising increasingly accurate estimates.
In this project, we question the ""A* + distance estimator"" paradigm and
explore three new directions that go beyond the classical approach:
1. We propose a new paradigm of declarative heuristics, where heuristic
information is not represented as distance estimates, but as properties of
solutions amenable to introspection and general reasoning.
2. We suggest moving the burden of creativity away from the human expert by
casting heuristic design as a meta-optimization problem that can be solved
automatically.
3. We propose abandoning the idea of exploring sequential paths in state
spaces, instead transforming state-space search into combinatorial
optimization problems with no explicit sequencing aspect. We argue that the
""curse of sequentiality"" is as bad as the curse of dimensionality and must
be addressed head-on."
Summary
"Many problems in computer science can be cast as state-space search, where the
objective is to find a path from an initial state to a goal state in a
directed graph called a ""state space"". State-space search is challenging due
to the state explosion problem a.k.a. ""curse of dimensionality"": interesting
state spaces are often astronomically large, defying brute-force exploration.
State-space search has been a core research problem in Artificial Intelligence
since its early days and is alive as ever. Every year, a substantial fraction
of research published at the ICAPS and SoCS conferences is concerned with
state-space search, and the topic is very active at general AI conferences
such as IJCAI and AAAI.
Algorithms in the A* family, dating back to 1968, are still the go-to approach
for state-space search. A* is a graph search algorithm whose only
""intelligence"" stems from a so-called ""heuristic function"", which estimates
the distance from a state to the nearest goal state. The efficiency of A*
depends on the accuracy of this estimate, and decades of research have pushed
the envelope in devising increasingly accurate estimates.
In this project, we question the ""A* + distance estimator"" paradigm and
explore three new directions that go beyond the classical approach:
1. We propose a new paradigm of declarative heuristics, where heuristic
information is not represented as distance estimates, but as properties of
solutions amenable to introspection and general reasoning.
2. We suggest moving the burden of creativity away from the human expert by
casting heuristic design as a meta-optimization problem that can be solved
automatically.
3. We propose abandoning the idea of exploring sequential paths in state
spaces, instead transforming state-space search into combinatorial
optimization problems with no explicit sequencing aspect. We argue that the
""curse of sequentiality"" is as bad as the curse of dimensionality and must
be addressed head-on."
Max ERC Funding
1 997 510 €
Duration
Start date: 2019-02-01, End date: 2024-01-31
Project acronym BeadsOnString
Project Beads on String Genomics: Experimental Toolbox for Unmasking Genetic / Epigenetic Variation in Genomic DNA and Chromatin
Researcher (PI) Yuval Ebenstein
Host Institution (HI) TEL AVIV UNIVERSITY
Call Details Starting Grant (StG), PE4, ERC-2013-StG
Summary Next generation sequencing (NGS) is revolutionizing all fields of biological research but it fails to extract the full range of information associated with genetic material and is lacking in its ability to resolve variations between genomes. The high degree of genome variation exhibited both on the population level as well as between genetically “identical” cells (even in the same organ) makes genetic and epigenetic analysis on the single cell and single genome level a necessity.
Chromosomes may be conceptually represented as a linear one-dimensional barcode. However, in contrast to a traditional binary barcode approach that considers only two possible bits of information (1 & 0), I will use colour and molecular structure to expand the variety of information represented in the barcode. Like colourful beads threaded on a string, where each bead represents a distinct type of observable, I will label each type of genomic information with a different chemical moiety thus expanding the repertoire of information that can be simultaneously measured. A major effort in this proposal is invested in the development of unique chemistries to enable this labelling.
I specifically address three types of genomic variation: Variations in genomic layout (including DNA repeats, structural and copy number variations), variations in the patterns of chemical DNA modifications (such as methylation of cytosine bases) and variations in the chromatin composition (including nucleosome and transcription factor distributions). I will use physical extension of long DNA molecules on surfaces and in nanofluidic channels to reveal this information visually in the form of a linear, fluorescent “barcode” that is read-out by advanced imaging techniques. Similarly, DNA molecules will be threaded through a nanopore where the sequential position of “bulky” molecular groups attached to the DNA may be inferred from temporal modulation of an ionic current measured across the pore.
Summary
Next generation sequencing (NGS) is revolutionizing all fields of biological research but it fails to extract the full range of information associated with genetic material and is lacking in its ability to resolve variations between genomes. The high degree of genome variation exhibited both on the population level as well as between genetically “identical” cells (even in the same organ) makes genetic and epigenetic analysis on the single cell and single genome level a necessity.
Chromosomes may be conceptually represented as a linear one-dimensional barcode. However, in contrast to a traditional binary barcode approach that considers only two possible bits of information (1 & 0), I will use colour and molecular structure to expand the variety of information represented in the barcode. Like colourful beads threaded on a string, where each bead represents a distinct type of observable, I will label each type of genomic information with a different chemical moiety thus expanding the repertoire of information that can be simultaneously measured. A major effort in this proposal is invested in the development of unique chemistries to enable this labelling.
I specifically address three types of genomic variation: Variations in genomic layout (including DNA repeats, structural and copy number variations), variations in the patterns of chemical DNA modifications (such as methylation of cytosine bases) and variations in the chromatin composition (including nucleosome and transcription factor distributions). I will use physical extension of long DNA molecules on surfaces and in nanofluidic channels to reveal this information visually in the form of a linear, fluorescent “barcode” that is read-out by advanced imaging techniques. Similarly, DNA molecules will be threaded through a nanopore where the sequential position of “bulky” molecular groups attached to the DNA may be inferred from temporal modulation of an ionic current measured across the pore.
Max ERC Funding
1 627 600 €
Duration
Start date: 2013-10-01, End date: 2018-09-30