This repo contains a dataset of 388 5-member heterocycles' reactivity entries in C-H functionalization and the code to train random forest regression models. Specifically, the data were collected in an iterative process with 8 rounds of active learning, using predictive models trained on each round of compound selection followed by experimentation to guide us in selection of the next round of compounds. Compared to collecting all data at once, this targeted sampling approach seeks to reduce the number of experiments and calculations to obtain a sufficiently robust model. This model continues to be used and updated as new use cases arise to test its performance and expand its domain of applicability.
-
Notifications
You must be signed in to change notification settings - Fork 0
License
MSDLLCpapers/HeterocycleReactivityML
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
No description, website, or topics provided.
Resources
License
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published