The Site Scanning program is a website scanner for federal government professionals. Excellent federal websites lead to the delivery of excellent digital services for the public and public servants. This program gives you instant access to data on the big picture of what's working really well on federal websites. Data indicating the most critical web hygiene and user-friendly features present on a federal website. These top features of excellence are distilled from industry standards, federal laws, and user needs.
The core of the program is a scanning engine that performs a series of scans that each check for a particular feature on each individual website included in a continuously-updated list of federal domains and subdomains. The system compiles all the scan results into simple data files called JSON. To access these JSON files, users pull the data into their own system using the Site Scanning API, or by engaging directly on a report page.
The below is assorted answered prompts that we're gathering here both for ease of access, but also to further give context to the program.
The Site Scanning programs provides the only comprehensive index of federal websites, and generates over 100 data points about each one. With this data, GSA, OMB, and other agencies can track what and how many public websites each agency operates and offers qualitative metrics for performance, accessibility, compliance, and more. TTS programs are also able to use Site Scanning data for business intelligence to compute adoption rates and to perform market research.
- 5841 Downloads in FY24
- 21st Century IDEA
- OMB-23-22
- OMB
- GSA
- NASA
- CISA
- NIST
- CIO Council Digital Experience Council Members
- OMB OFCIO
- GSA Digital Council
- GSA/TTS Programs (DAP, USWDS, Touchpoints)
- $420,000
- 1.5 FTE
- Improve user experience of the Site Scanning dataset for stakeholders by improving index completeness and technical documentation.
Deliverable | Significance | Expected Outcome | Risk | Expected Quarter |
---|---|---|---|---|
Expand Website Index | By doubling the number of seed files (12->25), the inventory of websites will become closer to 100% complete. | More websites scanned | n/a | fy25 q2 |
Expand Technical Documentation | Creating an expanded data dictionary and graphical representations to further explain processes will help agencies better understand and use the data. | Agencies better able to self-serve with the data | n/a | fy25 q3 |
Renew ATO | Maintains system operations. | ATO completed and program continues | n/a | fy25 q3 |
- Usage Definition: Site Scanning data downloads
- Hypothetical service offering by a company (doesn't exist though):
- 87 agencies via m-23-22. Another 4 agency stakeholders past those.
- Rough estimate for value of dataset if licensed: $10k/agency/year.
- Rough value generated: $910,000/year
- Agencies stand up and run their own instance of Site Scanning:
- Assume 50% of current users want to continue having this data for their agency
- $60K/year Cloud.gov & $72K/year Software engineering (25% FTE, 40 hrs/month, $150/hr) per agency
- Rough value generated: $5.4M