Data explorer

Explore the molecular dataset used in this application

Dataset overview

643

Total molecules

View molecules

4

Clusters

View details

4

Properties

View details

6

Descriptors

View details
Dataset description

This dataset contains 643 molecules clustered into 4 groups based on their electronic and structural properties. The clustering was performed using K-means algorithm with 4 clusters after dimensionality reduction using Principal Component Analysis (PCA).

Included properties
  • \(E_{\textrm{S}}-E_{\textrm{A}}\) (eV)
  • LUMO energy (eV)
  • HOMO energy (eV)
  • Dielectric constant of solvents
  • Cluster assignment (1-4)
Data source

The molecular data was obtained from Density Functional Theory(DFT), Molecular Dynamics(MD) calculation and deep-learning model prediction. The dielectric constants represent the solvent environment used in the calculations.

Cluster distribution
Cluster Number of molecules Percentage
Cluster 1 270 42.0%
Cluster 2 48 7.4%
Cluster 3 39 6.1%
Cluster 4 286 44.5%
Property statistics
Property Min Max Mean Std Dev
\(E_{\textrm{S}}-E_{\textrm{A}}\) (eV) -0.07 2.06 0.92 0.22
LUMO energy (eV) -0.75 8.61 -4.26 4.03
HOMO energy (eV) -9.58 -7.35 -7.69 0.25
Dielectric constant 1.01 58.85 5.22 6.09
Molecular descriptors
Show all
Cluster 1
Cluster 2
Cluster 3
Cluster 4
Steric hindrance

Molecular size and shape

F count

Number of fluorine atoms

Dielectric constant

Solvent polarity

\(E_{\textrm{S}}-E_{\textrm{A}}\)

Relative binding ability

LUMO

Reduction stability

HOMO

Oxidation stability