MORE-Q, Dataset for molecular olfactorial receptor engineering by quantum mechanics
ChemRxiv (2024).
Li Chen, Leonardo Medrano Sandonas, Philipp Traber, Arezoo Dianat, Nina Tverdokhleb, Mattan Hurevich, Shlomo Yitzchaik, Rafael Gutierrez, Alexander Croy and Gianaurelio Cuniberti.
Journal DOI: https://doi.org/10.26434/chemrxiv-2024-zvt26

We introduce the MORE-Q dataset, a quantum-mechanical (QM) dataset encompassing the structural and electronic data of
non-covalent molecular sensors formed by combining 18 mucin-derived olfactorial receptors with 102 body odor volatilome
(BOV) molecules. To have a better understanding of their intra- and inter-molecular interactions, we have performed accurate
QM calculations in different stages of the sensor design and, accordingly, MORE-Q splits into three subsets: i) MORE-Q-G1:
QM data of 18 receptors and 102 BOV molecules, ii) MORE-Q-G2: QM data of 23,838 BOV-receptor configurations, and iii)
MORE-Q-G3: QM data of 1,836 BOV-receptor-graphene systems. Each subset involves geometries optimized using GFN2-xTB
with D4 dispersion correction and up to 39 physicochemical properties, including global and local properties as well as binding
features, all computed at the tightly converged PBE+D3 level of theory. By addressing BOV-receptor-graphene systems from
a QM perspective, MORE-Q can serve as a benchmark dataset for state-of-the-art machine learning methods developed to
predict binding features. This, in turn, can provide valuable insights for developing the next-generation mucin-derived olfactory
receptor sensing devices.

Cover
©https://doi.org/10.26434/chemrxiv-2024-zvt26
Share


Involved Scientists
MORE-Q, Dataset for molecular olfactorial receptor engineering by quantum mechanics
ChemRxiv (2024).
Li Chen, Leonardo Medrano Sandonas, Philipp Traber, Arezoo Dianat, Nina Tverdokhleb, Mattan Hurevich, Shlomo Yitzchaik, Rafael Gutierrez, Alexander Croy and Gianaurelio Cuniberti.
Journal DOI: https://doi.org/10.26434/chemrxiv-2024-zvt26

We introduce the MORE-Q dataset, a quantum-mechanical (QM) dataset encompassing the structural and electronic data of
non-covalent molecular sensors formed by combining 18 mucin-derived olfactorial receptors with 102 body odor volatilome
(BOV) molecules. To have a better understanding of their intra- and inter-molecular interactions, we have performed accurate
QM calculations in different stages of the sensor design and, accordingly, MORE-Q splits into three subsets: i) MORE-Q-G1:
QM data of 18 receptors and 102 BOV molecules, ii) MORE-Q-G2: QM data of 23,838 BOV-receptor configurations, and iii)
MORE-Q-G3: QM data of 1,836 BOV-receptor-graphene systems. Each subset involves geometries optimized using GFN2-xTB
with D4 dispersion correction and up to 39 physicochemical properties, including global and local properties as well as binding
features, all computed at the tightly converged PBE+D3 level of theory. By addressing BOV-receptor-graphene systems from
a QM perspective, MORE-Q can serve as a benchmark dataset for state-of-the-art machine learning methods developed to
predict binding features. This, in turn, can provide valuable insights for developing the next-generation mucin-derived olfactory
receptor sensing devices.

Cover
©https://doi.org/10.26434/chemrxiv-2024-zvt26
Share


Involved Scientists