Project Description
I need a small Python utility that can visit a couple of CAD-focused websites and pull down detailed product information in a single pass. The job is one-time, so I’m not looking for scheduling or cron integration—just a clean script I can rerun locally whenever I want.
Key details I have to capture per product include the title, part number, main specs, and any listed price or download link. Most pages are straightforward HTML, but a few rely on lazy-loaded sections, so the solution may call for Requests + BeautifulSoup in the simple cases and Selenium or Playwright where JavaScript gets in the way. I’ll provide the shortlist of URLs once we start.
Please write clear, well-commented code, keep third-party dependencies to a minimum, and organise output in both CSV and JSON formats.
Deliverables:
• Python script(s) with all parsing logic
• requirements.txt for easy setup
• README with run instructions and example command
• The scraped dataset for the sample run
If anything about the target sites needs clarification, let me know up front so we can keep iterations minimal.