monroeminutes.org

Tools and Technologies

The site is built on Python scrapers, and PHP web back-end interfacing to a MySQL database.

The base classes for both the python and the PHP were generated from SQL via sql2api ( http://github.com/thequbit/sql2api ).

Overview

MonroeMinutes, and thus monroeminutes.org, is a document agrigator and indexer. The tools scrape websites within Monroe County and pull meeting minutes and other important documents from various village, town, city, and county websites. These are then indexed using a histogram method with keyword rejection (black list of words that are too common to include in search).

The data is setup in the following way:

|-> Organization 0
	|-> Sub Organization 0
		|-> Document 0
			|-> <token data>
		|-> Document 1
			|-> <token data>
		|-> Document 2
				|-> <token data>

	|-> Sub Organization 1
		|-> Document 0
			|-> <token data>

|-> Organization 1
	|-> Sub Organization 0
		|-> Document 0
			|-> <token data>
			|-> <token data>
		|-> Document 1
			|-> <token data>

Name		Name	Last commit message	Last commit date
Latest commit History 201 Commits
db		db
scripts		scripts
web		web
README.md		README.md
rungource.sh		rungource.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

monroeminutes.org

Tools and Technologies

Overview

About

Releases

Packages

hhroc/monroeminutes

Folders and files

Latest commit

History

Repository files navigation

monroeminutes.org

Tools and Technologies

Overview

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages