Project Description
Italian Long Document Sourcing - (AI Training Project)
Summary
We are seeking detail-oriented freelancers to support a large-scale data sourcing project focused on training advanced AI systems. This project involves sourcing high-quality long-form documents in Italian across multiple domains and categories.
Project Scope
Total Documents Required: 140
Coverage: 17 domains and 140 fine-grained categories
Requirement: 1 document per category
Document Length: Minimum 40 pages, Maximum 100 pages
Key Responsibilities
Ensure all documents are real-world data only (no synthetic or AI-generated content), created within the last 10 years, and relevant to the assigned domain and category. Maintain high-quality structure, layout, and formatting, and strictly follow all provided sourcing guidelines.
Mandatory Requirements
No duplicate templates — each of the 140 documents must follow a unique structure/template. Documents must not be sourced from public benchmark datasets. Only genuine, real-world documents will be accepted.
Compensation & Candidate Profile
Each approved submission will be paid at a fixed rate of $40 per document. Candidates with familiarity in Italian document formats and structures are preferred. Prior experience in data sourcing, data entry, document annotation, or AI training datasets is a plus but not mandatory.
Additional Information
This is a recurring opportunity, with ongoing batches available based on the quality and consistency of submissions. Only guideline-compliant submissions will be approved.