Daily MCA Company Data Scraper

—

Pending

💰 INR 5000–20000 👤 Unknown 🕒 24d ago status: new

Required Skills

Python Data Processing Web Scraping Software Architecture Data Mining Data Extraction BeautifulSoup Selenium

Project Description

Each morning I need a fresh Excel workbook that lists every company incorporated the previous day on mca.gov.in. The file must hold the company’s name, registered address, mobile number, email ID, date of incorporation, director’s name, and company registration number, all captured in a single automated sweep of the site. Workflow I picture • A script or small service (Python + BeautifulSoup/Selenium, Node, or any stable stack you prefer) pings the MCA portal once every 24 hours, pulls the latest incorporation feed, parses the required fields, and writes them to a clean XLSX sheet. • The script then places the file at a predictable URL and returns a direct download link; no email dispatch is needed. • Duplicate or previously fetched records are skipped so each daily sheet contains only that day’s incorporations. • Any parsing failures or site changes should trigger a simple log so I can track issues. Acceptance criteria • Daily job completes successfully for 7 consecutive days on a test run. • Columns appear in the exact order and naming provided above. • Download link is accessible without manual login. • Solution is handed over with source code, setup instructions, and a one-click way to redeploy on a different server if required. If an API key, CAPTCHA bypass, or headless browser is required, please factor it in; I’m fine with open-source libraries as long as licensing is clear.

Actions

↗ View on Freelancer