-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathtask flowchart
37 lines (37 loc) · 1.52 KB
/
task flowchart
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
Start
|
|--> main()
|
|--> Initialize all_links with target URL(s)
|
|--> For each URL in all_links:
| |
| |--> get_all_data(url)
| |
| |--> get_all_a_tags(url)
| | |
| | |--> bs4_contents(url)
| | |
| | |--> Request URL
| | |--> Parse HTML content
| |
| |--> For each <a> tag:
| |
| |--> Check if tag has been visited
| | |
| | |--> Construct full URL
| | |
| | |--> Check URL format
| | |--> Append to visited list
| |
| |--> get_content(full_url)
| |
| |--> bs4_contents(full_url)
| |
| |--> Request URL
| |--> Parse HTML content
| |--> Extract text content
| |
| |--> Save content to content.txt
|
End