Simplify init #82

mcocdawc · 2025-01-09T22:26:33Z

moved the saving operation out of the init method.
don't redundantly store information from fobj, but use fobj itself directly.

- this fixed a likely bug in optimize. previously we had J0 = [[J0[-1, -1]]] now it is J0 = [[J0[-1][-1]]] since it is a list[list[float]]

finished BEOPT attrs refactoring

call self.scratch_dir.cleanup in both `optimize` and `oneshot`

makes much more sense like this

…h_dir_attempt

and fixed corresponding tests

…r_solver_kwargs

mscho527 · 2025-01-10T22:00:18Z

I'll have to think a little bit more, but I am a bit wary of directly using self.fobj. If we are concerned about the redundant information we are retaining here, is there a reason why we wouldn't take the information from the fobj in the init and not retain self.fobj instead? I'll think about this a bit longer and update lol; just my very first thoughts :)

mcocdawc · 2025-01-11T20:55:21Z

I'll have to think a little bit more, but I am a bit wary of directly using self.fobj. If we are concerned about the redundant information we are retaining here, is there a reason why we wouldn't take the information from the fobj in the init and not retain self.fobj instead? I'll think about this a bit longer and update lol; just my very first thoughts :)

The reason why the fragpart (it's the type of self.fobj) class exists, is that its components Nfrag or fsites belong together and it makes sense to collect them into a class. If we were using only one or two components of self.fobj, then I could live with extracting them from self.fobj, but the way it is written currently we extract all components of self.fobj and unnecessarily break encapsulation.

This is:

Unnecessarily verbose
Hard to maintain.
2.1. What do you do if you add a component to fragpart? Do you always have to remember to add it also to BE?¹
2.2. What do you do if you change the representation of fragments altogether? At the moment you have to change unnecessarily much in BE.
Redundant. Do you use self.fobj.Nfrag or self.Nfrag? If we have self.Nfrag, then we should not also store self.fobj, or the other way round.

¹ That's not hypothetical. I need to store some more lookup information per fragment for the faster integral transformation.

Just to give a small example where I think it becomes obvious:

Let's say you had a 2D Rectangle:

class Rectangle:
    def __init__(self, a: float, b: float) -> None:
        self.a = a
        self.b = b

    def area(self) -> float:
        return self.a * self.b

then you can easily implement a Prism on top

class Prism:
    def __init__(self, base: Rectangle, h: float) -> None:
        self.base = base
        self.h = h

    def volume(self) -> float:
        return self.base.area() * self.h

It is easy to see how this code works even with general 2D shapes without any changes (apart from the type hinting)

But you can also unnecessarily couple the Prism to the rectangle:

class CoupledPrism:
    def __init__(self, rectangle: Rectangle, h: float) -> None:
        self.base = rectangle
        self.a = rectangle.a
        self.b = rectangle.b
        self.h = h

    def volume(self) -> float:
        return self.a * self.b * self.h

now you cannot just pass in another 2D shape with an area function, and the code looks more complicated.

Last but not least, writing the uncoupled version is super easy with attrs.define, while the coupled version is a bit more complicated. IMHO it is reliable rule of thumb, that if it is hard to write with attrs.define, then it is probably worse class design.

attrs example

from attrs import define, field

@define
class AttrsPrism:
    base: Rectangle
    h: float

    def volume(self) -> float:
        return self.base.area() * self.h

@define
class CoupledAttrsPrism:
    base: Rectangle
    h: float
    a: float = field(init=False)
    b: float = field(init=False)

     def __attrs_post_init__(self):
        self.a = self.base.a
        self.b = self.base.b

    def volume(self) -> float:
        return self.a * self.b * self.h

mscho527

cool cool nice

@lweisburn

* Automatic generation of API documentation (#70) - Create API reference automatically for all public members of quemb Before this change one had to manually create rst files and manage them. Now they are recursively created automatically. The layout of the documentation matches the layout of the namespace - This change automatically ensures that all (public) docstrings are actually parsed by Sphinx and it is ensured that they are properly written. Had to fix some docstrings - Type-hinting is now picked up by sphinx. * H5py use contextmanager (#73) * put if __main__ guard to test * use h5py contextmanager nearly everywhere * fixed a bug from wrong close statement * Fix parallelization and add parallel tests (#75) * fix _opt to work with be_parallel * modify oneshot to work with be_func_parallel * modify two tests to use nproc > 1 * fix ruff errors * modify octane test * Remove CCSD reference from octane test * Final scratch dir attempt (#74) # Mayor changes This PR unifies the use of the ScratchDir. It is now guaranteed, that files are cleaned up if the calculation exits with no error. (If the user passes `WorkDir(cleanup_at_end=False)` the files are retained). A `WorkDir` is passed from the top of the call stack to the bottom. This allows the user to change the WorkDir if necessary. (Fixes #19) Got rid of some special cased keyword arguments for DMRG and moved them into an explicit `DMRG_solver_kwargs`. This way we can catch arguments that are supplied and do nothing. It is probably a good idea to do the same for SHCI specific keywords @lweisburn . It is probably a good idea to get rid of the dictionary altogether and to replace it with a dataclass. # Minor changes If the `__init__` of a class was just populating the attributes, then I replaced the class definition with `attrs.define`. On the way I did a lot of type hinting. It is probably a good idea to double check my assumptions in review. Additionally, I did some refactoring, where explicit open-close pairs where replaced with context managers. The same was true for multiprocessing Pools. Here and there I encountered constructs that could be simplified a lot ```python3 veffs = [] [veffs.append(result.get()) for result in results] # becomes veffs = [result.get() for result in results] ``` or ```python3 self.frozen_core = False if not fobj.frozen_core else True # becomes self.frozen_core = fobj.frozen_core ``` or ```python3 rdm_return = False if relax_density: rdm_return = True # becomes rdm_return = relax_density ``` * Update fragment energies (#79) * Untangling eeval and frag_energy energy evaluation options * small changes for frag_energy consistency * start changing eeval and ereturn in parallel functions, making serial and parallel consistent * put if __main__ guard to test * use h5py contextmanager nearly everywhere * changed mbe.StoreBE into attrs defined class * ignore broken link to numpy.float64 in documentation * fixed a bug from wrong close statement * testsuite don't wait for analysis * added types for molbe BE object * fixed wrong close statements * fixed scratch-dir for molbe-be * added typing for molbe.BE.optimize - this fixed a likely bug in optimize. previously we had J0 = [[J0[-1, -1]]] now it is J0 = [[J0[-1][-1]]] since it is a list[list[float]] * renamed _opt.py to opt.py finished BEOPT attrs refactoring * pass on solver_kwargs to be_func * added types to be_func * added delete multiple_files function * use the new scratch dir in be_func * added types to molbe.BE.optimize + call self.scratch_dir.cleanup call self.scratch_dir.cleanup in both `optimize` and `oneshot` * moved schmidt_decomposition to prevent circular import makes much more sense like this * added type hints to be_func_parallel * fixed typo * fixed several small errors in the code * fixed be_func_parallel calls * simplified be_func_parallel * added types to run_solver * use frag_scratch in be_func_parallel * use frag_scratch in run_solver * added typehints to kbe BE class * simplified a few boolean expressions * the tests should be working now * ensure scratch cleanup for kbe * removed call to self.compute_energy_full (related to #35) * write nicer Pool statements - used proper list comprehensions - use contextmanager for Pool * fixed naming error * use more explicit way of freeing memory (it is still ugly... ) * refactored expression * use Tuple[int, ...] for numpy shapes :-( * refactored WorkDir to use atexit.register for cleanup * added better docstring for config * require static analysis to be ok for running test suite * renamed DMRG specific solver kwargs to its proper name * better naming for DMRG stuff * added types to block2 DMRG function * refactor some DMRG arguments * change behaviour of scratch contextmanager now it is ensured, that files are deleted even after an exception when using it as context manager and cleanup_at_end=True * fixed the deadlock in the test requirements * added new scratch dir also to ube * avoid list[list[float]]; use consistently array instead * Update energy keywords and logic in restricted BE, serial and parallel, oneshot and parallel. Update and rearrange some documentation for the keywords TODO: add non-cumulant energy, update unrestricted BE * Update kbe/pbe and kbe/misc for consistency * Fix kbe/pbe and kbe/misc to raise error for non-cumulant * Add non-cumulant energy option for molecular code * remove mypy attempt from get_energy_frag for now * fix be2puffin call of oneshot and update ube oneshot default * Update get_frag_energy function to work for periodic calculations * Remove double del line * remove redundant frag_energy keyword * Update src/quemb/molbe/helper.py, eri_file to be optional Co-authored-by: Minsik <[email protected]> * Update src/quemb/molbe/mbe.py: ebe_tot readability Co-authored-by: Minsik <[email protected]> * move use_cumulant to optimize and oneshot, not BE --------- Co-authored-by: Leah Weisburn <[email protected]> Co-authored-by: Oskar Weser <[email protected]> Co-authored-by: Minsik <[email protected]> * Adding back veff0 for molecular code, removing the now-unnecessary hf_veff from be_func (#81) * More type annotations + simpler boolean expressions (#76) Simplifications in the code - A couple more type annotations - simpler boolean expressions - removed keyword arguments if they are always true in the code * Dataclass for solver kwargs (#77) - Introduced a dataclass for solver kwargs. This is makes it much more explicit what arguments are supported by DMRG and SHCI - Could simplify the interface of BE by removing a couple of arguments * Simplify init (#82) - moved the saving operation out of the __init__ method. - don't redundantly store information from `fobj`, but use `fobj` itself directly. * Make numpy shorter (#83) - replacing `np.dot` with `@` - replacing the `numpy.something` calls with `from numpy import something` if functions appear several times - replacing the `numpy.something` call with `np.something` if a function appears rarely --------- Co-authored-by: Oskar Weser <[email protected]> Co-authored-by: Leah Weisburn <[email protected]> Co-authored-by: Leah Weisburn <[email protected]> Co-authored-by: Minsik <[email protected]>

mcocdawc added 30 commits December 19, 2024 15:39

put if __main__ guard to test

bf4605f

use h5py contextmanager nearly everywhere

eb2969d

changed mbe.StoreBE into attrs defined class

fa6cc27

ignore broken link to numpy.float64 in documentation

be6176b

fixed a bug from wrong close statement

86fa44b

Merge branch 'h5py_use_contextmanager' into final_scratch_dir_attempt

97925ff

testsuite don't wait for analysis

3942320

added types for molbe BE object

499aa5b

fixed wrong close statements

3492c66

fixed scratch-dir for molbe-be

35adb0c

added typing for molbe.BE.optimize

b0d8817

- this fixed a likely bug in optimize. previously we had J0 = [[J0[-1, -1]]] now it is J0 = [[J0[-1][-1]]] since it is a list[list[float]]

renamed _opt.py to opt.py

1bb1105

finished BEOPT attrs refactoring

pass on solver_kwargs to be_func

b46479b

added types to be_func

bcabc40

added delete multiple_files function

0517d47

use the new scratch dir in be_func

a458330

added types to molbe.BE.optimize + call self.scratch_dir.cleanup

5c384c0

call self.scratch_dir.cleanup in both `optimize` and `oneshot`

moved schmidt_decomposition to prevent circular import

f4cb9fd

makes much more sense like this

added type hints to be_func_parallel

83e6773

Merge branch 'main' of github.com:troyvvgroup/quemb into final_scratc…

2cdf8ff

…h_dir_attempt

fixed typo

e24a5e3

fixed several small errors in the code

b5ea348

fixed be_func_parallel calls

2fce4b9

simplified be_func_parallel

f5255c9

added types to run_solver

17f443f

use frag_scratch in be_func_parallel

f6f93c5

Merge branch 'main' of github.com:troyvvgroup/quemb into final_scratc…

14caf5c

…h_dir_attempt

use frag_scratch in run_solver

6904c62

added typehints to kbe BE class

fe7bf55

simplified a few boolean expressions

9086c9c

mcocdawc added 18 commits December 23, 2024 16:06

incremented the python version for type checking

02e7ac2

Merge branch 'more_type_annotations' into dataclass_for_solver_kwargs

5118a0c

try floating instead of float64

128fffb

don't overspecify the types

cbc1d7d

Merge branch 'more_type_annotations' into dataclass_for_solver_kwargs

0204600

added correct link to doc

6e7dc84

Merge branch 'more_type_annotations' into dataclass_for_solver_kwargs

570a9d2

simplified DMRG args

e88d6e4

use a factory for mutable default attributes

271b1b0

removed unused setting and changed default for SCRATCH to /tmp

4198d9b

added types to kbe.BE.optimize

214b091

and fixed corresponding tests

Merge branch 'main' of github.com:troyvvgroup/quemb into dataclass_fo…

d0f4c26

…r_solver_kwargs

fixed small error

a29ec7a

use tempfile.gettempdir()

6d9bdb7

fixed J0[-1, -1] also for kbe

5601fa4

separated save method

4681ab0

don't duplicate fobj information

b3a10f4

introduced save method also for kbe

545d5d3

mcocdawc mentioned this pull request Jan 10, 2025

Make numpy shorter #83

Merged

mcocdawc added 2 commits January 10, 2025 16:41

Merge branch 'main' of github.com:troyvvgroup/quemb into simplify_init

dd402ec

don't store fobj information redundantly in kbe

347b7bc

mcocdawc requested a review from mscho527 January 10, 2025 21:50

mcocdawc marked this pull request as ready for review January 10, 2025 21:50

made changes also for UBE

6b45f01

mscho527 approved these changes Jan 13, 2025

View reviewed changes

mcocdawc merged commit 7ca8d14 into main Jan 13, 2025
4 checks passed

mcocdawc deleted the simplify_init branch January 13, 2025 15:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Simplify init #82

Simplify init #82

mcocdawc commented Jan 9, 2025 •

edited

Loading

mscho527 commented Jan 10, 2025

mcocdawc commented Jan 11, 2025

mscho527 left a comment

Simplify init #82

Simplify init #82

Conversation

mcocdawc commented Jan 9, 2025 • edited Loading

mscho527 commented Jan 10, 2025

mcocdawc commented Jan 11, 2025

mscho527 left a comment

Choose a reason for hiding this comment

mcocdawc commented Jan 9, 2025 •

edited

Loading