Graph theoretic fragmentation via `graphgen`. #86

ShaunWeatherly · 2025-01-14T16:52:31Z

The idea is rather straightforward:

graphgen generalizes the BE n fragmentation scheme to arbitrary fragment sizes (I've tested BE1-BE9 so far) by using a graph theoretic heuristic. In it, atoms are assigned to nodes in an adjacency graph and edges are weighted by some distance metric. For a given fragment center site, Dijkstra's algorithm is used to find the shortest path from that center to its neighbors. The number of nodes visited on that shortest path determines the degree of separation of the corresponding neighbor. I.e., all atoms whose shortest paths from the center site visit at most 1 node must be direct neighbors (adjacent to the center site), which gives BE2-type fragments; all atoms whose shortest paths visit at most 2 nodes must then be second-order neighbors, hence BE3; and so on. This depends on the NetworkX library.

Major points to note:

Works for arbitrary n in BE n. (Yields near identical fragments to autogen for n=1->3)
Does not rely on heuristic bond lengths or atom identities.
Much simpler code, which I think we can all appreciate.
And most importantly - the distance metric for assigning edge weights in the adjacency graph can easily be changed. For example, I'm currently looking into fragmentation via entanglement (orbital pair mutual info).

EXTRA: now adds FragmentMap data class.
EXTRA2: there are 14 new unittests covering both autogen and graphgen

mcocdawc

Great improvement! Thank you!

I have a few strongly requested changes and some other comments.

src/quemb/molbe/autofrag.py

src/quemb/molbe/fragment.py

mcocdawc · 2025-01-15T01:49:16Z

oh and the most important comment, I really would add a test, probably something like molbe_octane_test.py just with this fragmentation. this could ensure, that they indeed produce the same results for BE1-3

ShaunWeatherly · 2025-01-15T20:58:15Z

Alright, unittests have been added for both autogen and graphgen, likewise I think I've addressed all of your other comments, but do let me know if there's anything I've missed!

mcocdawc

Looks really good now.
The only big question mark I still have is about the euclidean_norm function.

A general comment just for the record and potential improvements for the future; we talked about it in person.
I think there is unnecessary casting between lists, tuples and sets.
This graph operation can most likely be completely done with sets and frozenset with only one cast to an ordered container such as list in the end.
In the current implementation there is more book-keeping necessary to avoid duplicates and unnecessary frequent casts.
But the code works, is really well tested, and readable. If you want to keep it like this, then I am also fine.

.ruff.toml

src/quemb/molbe/autofrag.py

src/quemb/molbe/fragment.py

src/quemb/molbe/lo.py

src/quemb/molbe/solver.py

tests/fragmentation_test.py

src/quemb/molbe/autofrag.py

tests/fragmentation_test.py

ShaunWeatherly added 9 commits January 14, 2025 11:28

Added graphgen fragmentation.

4a3a4b3

Fix reference to mf.mol when lo_method="pipek"

cacf442

Fix reference to frag_scratch when solver=="DMRG"

61907e2

Ruff fixes.

17f1b84

Formatting and removed unused code.

a37a0e2

Final formatting

1f17bcd

Update dependencies.

865b9c2

mypy static typing.

364ae2f

Suppress mypy for networkx imports.

f0fd882

ShaunWeatherly requested a review from mcocdawc January 14, 2025 18:06

ShaunWeatherly added 2 commits January 14, 2025 13:09

Remove debug prints

cd77b7d

Added complete docstring for graphgen.

cff391d

mcocdawc requested changes Jan 14, 2025

View reviewed changes

ShaunWeatherly added 16 commits January 15, 2025 10:29

Add types package for networkx

ec70e05

Unsuppress mypy warnings

49c20d5

norm

24adc16

New FragmentMap data class.

5fec9ed

Edits to FragmentMap typing.

7a4e53d

Remove unused kwargs.

f94c164

Fix formatting

79dce30

Final formatting

a4333a6

Additions to nitpick_exceptions.

083e275

FragmentMap Docstring edits.

bfa1273

Remove unfinished code.

31186f7

Use np.floating[Any]

98b4f72

Organize imports.

7d4e7b5

Fixes for FragmentMap

815f0fe

Add unit tests for autogen and graphgen

e9d5db9

Long line fix.

79a0627

ShaunWeatherly added 4 commits January 15, 2025 15:33

Suppress E501 in fragmentation_test

546ee00

Add ruff exclusion rule for fragmentation_tests

d21a66a

Ruff formatting yet again.

c5b60d9

Add fragmentation_test to mypy blacklist

5ebb8d4

ShaunWeatherly requested a review from mcocdawc January 15, 2025 20:58

ShaunWeatherly added 13 commits January 16, 2025 15:23

Remove defaults in FragmentMap init.

432efcc

Add unit tests for energy comparisons across autogen and graphgen

5a6ec5b

Add checks for IAOs in graphgen

0fddbff

Update graphgen docstring

d9fee42

Formatting.

1eab146

Test intersphinxlink

b8f1eaa

More strict typing for adjacency_mat

fcf5c23

Test removing fragment_map from nitpick exceptions.

e07aa2d

Rename valence_basis to iao_valence_basis

5b2108b

Finish renaming valence_basis to iao_valence_basis

bbbbe9b

Ruff fixes.

5cc31d7

Final formatting.

32a1eff

Update molbe_ppp

af8a8cf

mcocdawc reviewed Jan 17, 2025

View reviewed changes

mcocdawc mentioned this pull request Jan 17, 2025

added chemical fragmentation #85

Draft

mcocdawc reviewed Jan 17, 2025

View reviewed changes

tests/fragmentation_test.py Show resolved Hide resolved

ShaunWeatherly added 2 commits January 17, 2025 14:57

Address Oskar comments.

b4e2b5a

Simplified check for fragpart.mol

8253907

ShaunWeatherly requested a review from mcocdawc January 17, 2025 20:30

mcocdawc approved these changes Jan 17, 2025

View reviewed changes

ShaunWeatherly merged commit 8c410c4 into main Jan 17, 2025
4 checks passed

ShaunWeatherly deleted the new_graphgen branch January 17, 2025 21:20

mscho527 mentioned this pull request Jan 19, 2025

issues with graphgen #90

Open

4 tasks

mcocdawc mentioned this pull request Jan 20, 2025

Indexing fixes, optimization, and etc. for graphgen #91

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Graph theoretic fragmentation via `graphgen`. #86

Graph theoretic fragmentation via `graphgen`. #86

ShaunWeatherly commented Jan 14, 2025 •

edited

Loading

mcocdawc left a comment

mcocdawc commented Jan 15, 2025

ShaunWeatherly commented Jan 15, 2025

mcocdawc left a comment •

edited

Loading

Graph theoretic fragmentation via graphgen. #86

Graph theoretic fragmentation via graphgen. #86

Conversation

ShaunWeatherly commented Jan 14, 2025 • edited Loading

mcocdawc left a comment

Choose a reason for hiding this comment

mcocdawc commented Jan 15, 2025

ShaunWeatherly commented Jan 15, 2025

mcocdawc left a comment • edited Loading

Choose a reason for hiding this comment

Graph theoretic fragmentation via `graphgen`. #86

Graph theoretic fragmentation via `graphgen`. #86

ShaunWeatherly commented Jan 14, 2025 •

edited

Loading

mcocdawc left a comment •

edited

Loading