-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathReadMe.txt
46 lines (33 loc) · 1.49 KB
/
ReadMe.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
Assignment Objective :- This assignment is to extract textual data articles from the given URL in excel sheet and perform sentimental text analysis and to compute variables using python.
Folder view -
Project/
├── data/
│ ├── Input.xlsx
│ ├── extracted_articles/
├── StopWords/
│ ├── StopWords_Auditor.txt
│ ├── StopWords_Currencies.txt
│ ├── StopWords_DatesandNumbers.txt
│ ├── StopWords_Generic.txt
│ ├── StopWords_GenericLong.txt
│ ├── StopWords_Geographic.txt
│ ├── StopWords_Names.txt
├── MasterDictionary/
│ ├── positive-words.txt
│ ├── negative-words.txt
├── main_script.py
-----readme-----
Steps to Set Up and Run
1. Install Necessary Dependencies
pip install pandas openpyxl nltk requests beautifulsoup4
2. Run the Script
To execute the script, follow these steps:
Dependencies:-
The following libraries are required:
pandas: For reading/writing Excel files and managing DataFrames.
openpyxl: For Excel file operations.
nltk: For text processing (requires additional downloads).
requests: For fetching data from URLs.
beautifulsoup4: For parsing HTML content.
Test Assignment Google colab link:- https://colab.research.google.com/drive/1Vz5oBqt4PnSeKEwtZH9MxrtPE4Jeina3?usp=sharing
Follow the objective of the assignment to understand it thoroughly. Details about the text analysis process are provided in a separate document.