The first part of the assessment requires a Python script that extracts web links from the Current Estimates web page of the U.S. Census Bureau and outputs those links in a CSV file in an absolute and non-duplicated format. This is the main URL you can navigate from: https://www.census.gov/programs-surveys/popest.html

Make sure that you extract ALL unique URLs and produce a CSV file to include in your project submission. Make sure you handle trailing symbols in your Python script; if you have these URLs, your output is NOT unique: ask.census.gov and ask.census.gov/, or www.commerce.gov and www.commerce.gov/.

All links should be in an absolute format. They may be prefixed by 'http://' or https://, but they do not have to.

As the Current Estimates web site is a living document and subject to change, the number of unique links will vary. Include the output of your Python scraper based on the extraction of links on the day of the submission.

The task asks you to only extract links that point to other HTML pages. Please ensure that links to PDF, Excel or other documents are not included in the output file.

Please provide detailed explanations that relate to the ask prompts and provide the Python code segments that are used to accomplish the actions that are being explained. Explain the use of Python libraries and modules as relevant.

Academic Honesty!
It is not our intention to break the school's academic policy. Posted solutions are meant to be used as a reference and should not be submitted as is. We are not held liable for any misuse of the solutions. Please see the frequently asked questions page for further questions and inquiries.
Kindly complete the form. Please provide a valid email address and we will get back to you within 24 hours. Payment is through PayPal, Buy me a Coffee or Cryptocurrency. We are a nonprofit organization however we need funds to keep this organization operating and to be able to complete our research and development projects.