Reaction Outcome Prediction Task Overview

Definition: Reaction outcome prediction is to predict the reaction products given a set of reactants. Reaction outcome prediction can be seen as reverse process of retrosynthesis prediction.

Impact: Predicting the products as a result of a chemical reaction is a fundamental problem in organic chemistry. It is quite challenging for many complex organic reactions. Conventional empirical methods that relies on experimentation requires intensive manual label of an experienced chemist, and are always time-consuming and expensive. Reaction Outcome Prediction aims at automating the process.

Generalization: The model is expected to accurately generate product for novel set of reactants across reaction types with varying reaction conditions.

Product: Small-molecule.

Pipeline: Manufacturing - Synthesis planning.

USPTO

Dataset Description: The full USPTO (United States Patent and Trademark Office) retrosynthesis dataset.

Task Description: Given reactant set X, predict the reaction product Y.

Dataset Statistics: 1,939,253 reactions.

Dataset Split: Random Split

from tdc.generation import Reaction
data = Reaction(name = 'USPTO')
split = data.get_split()

References:

[1] Daniel Lowe. Chemical reactions from US patents (1976-Sep2016).

[2] Jin, Wengong, et al. “Predicting organic reaction outcomes with weisfeiler-lehman network.” NeurIPS. 2017.

Dataset License: CC0.