-
Notifications
You must be signed in to change notification settings - Fork 2
Home
Christoph Kindl edited this page Jan 24, 2015
·
14 revisions
- download and extract the following TAR file into the repo directory: download -- contains the folder structure and files needed by the classifier
- run
mvn compile exec:java -Dexec.mainClass="at.ac.tuwien.infosys.dsg.aic.ws2014.g4.t1.webservice.rest.JettyTwitterSentimentRestService"
to start the webservice - use GUI (gui/index.html) to test the webservice
(see http://help.sentiment140.com/for-students/ for original data)
- 10k entries (5k+,5k-) [download](https://kindl.io/owncloud/public.php? service=files&t=99513d59c214bd98f275e8235c93ae01) (MD5: c1aed1c76841314250997141ca554d6a)
- 20k entries (10k+,10k-) download (MD5: 347156a815170354da0e9da8aee2f544)
- 100k entries (50k+,50k-) download (MD5: 313db80631934d4cb09bffa32031b8f1)
- 200k entries (100k+,100k-) download (MD5: 5b69335910ce5fdad597746d34612b29)
- 500k entries (250k+,250k-) download (MD5: a5bd44a08fa06b2b27222892f23c9b89)
(standard options used -- see WEKA API for classifier implementations mentioned below)
-
10k entries (5k+,5k-):
-
20k entries (10k+,10k-)
-
100k entries (50k+,50k-)
-
200k entries (100k+,100k-)
-
500k entries (250k+,250k-)
- TODO (took too long / too much resources)
-
10k entries (5k+, 5k-)
- IBk: correct: 207 instances (41.5663%); incorrect: 291 instances (58.4337%)
- NaiveBayes: correct: 234 instances (46.988%); incorrect: 264 instances (53.012%)
- SMO: correct: 262 instances (52.6104%); incorrect: 236 instances (47.3896%)
-
20k entries (10k+,10k-)
- IBk: correct: 215 instances (43.1727%); incorrect: 283 instances (56.8273%)
- NaiveBayes: correct: 232 instances (46.5863%); incorrect: 266 instances (53.4137%)
- SMO: correct: 259 instances (52.008%); incorrect: 239 instances (47.992%)
-
100k entries (50k+,50k-)
- IBk: correct: 238 instances (47.7912%); incorrect: 260 instances (52.2088%)
- NaiveBayes: correct: 240 instances (48.1928%); incorrect: 258 instances (51.8072%)
- SMO: correct: 272 instances (54.6185%); incorrect: 226 instances (45.3815%)
-
200k entries (100k+,100k-)
- IBk: correct: 242 instances (48.5944%); incorrect: 256 instances (51.4056%)
- NaiveBayes: correct: 244 instances (48.996%); incorrect: 254 instances (51.004%)
- SMO: correct: 262 instances (52.6104%); incorrect: 236 instances (47.3896%)