ML4MSD Group Project: coGN & coNGN OOD Evaluation

Evaluating graph neural networks for 2D material property prediction under out-of-distribution conditions using MatFold splits.

Course: ML4MSD @ Northeastern University
Author: Ardavan Mehdizadeh
Date: December 2025

Background

Standard benchmarks like Matbench use random splits, which leak information between train/test sets (similar compositions appear in both). MatFold provides systematic OOD splits that actually test generalization.

Models

Model	Description
coGN	Crystal graph network — atoms as nodes, bonds as edges
coNGN	Adds line graph to capture bond angles

Dataset

matbench_jdft2d — 2D materials exfoliation energies (636 samples)
70/20/10 train/val/test split

Results

Split	coGN (meV/atom)	coNGN (meV/atom)	Winner
composition	27.2	30.6	coGN
chemsys	49.6	45.1	coNGN
sgnum	52.9	46.6	coNGN
periodictablegroups	52.4	53.7	coGN
pointgroup	59.8	51.6	coNGN
elements	60.7	91.0	coGN
Average	50.4	53.1	coGN

Key findings

coGN is more robust overall (lower average MAE)
coNGN wins on structural OOD — 9-14% better on pointgroup/sgnum/chemsys
coNGN fails hard on unseen elements — line graph overfits to element-specific bond patterns
Pick your model based on use case: coNGN for new structures, coGN for new elements

Structure

├── scripts/
│   ├── test_coGN_matfold.py      # main coGN script
│   ├── test_coNGN_matfold.py     # main coNGN script
│   └── run_coGN_all_splits.sh    # batch submission
├── results/
│   ├── coGN_matfold/             # JSONs, CSVs, saved models
│   └── coNGN_matfold/
├── environment.yml               # conda env spec
└── README.md

Saved Models

Each trained model is saved in TensorFlow SavedModel format:

coGN_chemsys_model/
├── fingerprint.pb              # model hash
├── keras_metadata.pb           # architecture metadata
├── saved_model.pb              # model graph
└── variables/
    ├── variables.data-00000-of-00001   # trained weights
    └── variables.index

Loading a model

import tensorflow as tf

model = tf.keras.models.load_model('results/coGN_matfold/coGN_chemsys_model')

# check inputs
print([inp.name for inp in model.inputs])
# ['offset', 'atomic_number', 'multiplicity', 'edge_indices']

Model sizes

Model	Size per split
coGN	~12 MB
coNGN	~40 MB

coNGN is larger due to the additional line graph layers for bond angle encoding.

Setup

conda env create -f environment.yml
conda activate cogn_env

Usage

Single split:

# edit SPLIT_TYPE in script, then:
python scripts/test_coGN_matfold.py

All splits (HPC):

bash scripts/run_coGN_all_splits.sh
squeue -u $USER  # monitor

Load saved model:

import tensorflow as tf
model = tf.keras.models.load_model('results/coGN_matfold/coGN_chemsys_model')

Split Types

Split	Tests
composition	same elements, different ratios
chemsys	entirely different element combos
elements	completely unseen elements
periodictablegroups	cross-group generalization
pointgroup	different symmetries
sgnum	different space groups

Training Config

50 epochs, batch 32, Adam @ 1e-3
KNN graph with k=12
StandardScaler on targets

Acknowledgments

Prof. Peter Schindler (course instructor, MatFold developer)

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
results		results
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ML4MSD Group Project: coGN & coNGN OOD Evaluation

Background

Models

Dataset

Results

Key findings

Structure

Saved Models

Loading a model

Model sizes

Setup

Usage

Split Types

Training Config

Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ML4MSD Group Project: coGN & coNGN OOD Evaluation

Background

Models

Dataset

Results

Key findings

Structure

Saved Models

Loading a model

Model sizes

Setup

Usage

Split Types

Training Config

Acknowledgments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages