At this point, q-learn.py
contains the code for approximating the value function with a parameterized linear function. Unfortunately, this function fails to converge appropriately and was quickly abandoned in favor of neural networks.
This release may not be stable but is mostly just so I can go back to this code since it has now been overwritten.