Project Overview
The Website Proof Reader is a standalone AI-based tool designed for internal use for quality control. The tool will crawl websites we have designed, proofread content for spelling and grammar according to the UK English Dictionary, and generate a report. It will also identify and flag specified ‘Caution words’ in the content. The ‘Caution words’ will be provided as a list in the system. The reports will be formatted in an easy-to-follow Word document with company branding at the top and contain tick boxes for easy sharing and internal use. The reports will contain the URL, and sentence in which there is a spelling, grammatical or caution word.
Key Features
Website Content Crawling
Input a website URL.
Crawl and extract all text content from the site.
Proofreading
Check for spelling and grammar errors according to the UK English Dictionary.
(E.g ‘optimise’ (UK) vs ‘optimize’ (US).)
Report Generation
Generate a Word document report.
Include company header and tick boxes.
List spelling and grammar errors with corrections as well as caution word highlights.
Caution Words Detection
Input a list of caution words (e.g., ‘Best’, ‘Finest’).
Identify and list occurrences of these words.
Provide the exact URL and sentence where each caution word is found.
Present this in a numbered list in the report.
Website access
1) We should be able to paste in a URL. Capability to scan live sites where source files are not available.
2) Source files: If you feel it will be more beneficial, then you can install the script in the folder on our server where 95% of the sites we need to check will be available in beta stages. This is the format of most of our sites https://domain/projects/variable-client-site/. Before they are live.
3) Document Upload Option – where the website isn’t live, and we just have a word document, we should be able to upload that also to generate a report.
Technical Requirements
Web Crawling
Proofreading
Grammar check
Report Generation
Caution Words Detection
Allow input of custom caution words.
Implement text search functionality to identify and list caution words.
Document Upload
User Interface
Simple and intuitive interface for inputting URLs and uploading documents.
Option to input custom caution words.
Downloadable Word document reports with company branding.
Implementation Plan
Requirement Analysis
Finalise feature list and technical specifications.
Gather all necessary inputs such as caution words and company branding details.
Development
Ideally deliver this using the power of OpenAI
Set up web crawling and content extraction.
Integrate proofreading engine with UK English Dictionary.
Develop report generation module.
Implement caution words detection.
Add source file access and document upload functionalities.
Testing
Perform thorough testing on various websites and document formats.
Ensure accuracy of proofreading and caution word detection.
Validate the report format and content.
Deployment
Deploy the tool as a standalone application.
Provide documentation and training for internal use.
Use Case
Internal Use for Quality Control: The tool will be used by web design companies to ensure high-quality content on their websites, both in the beta stage and for live sites.
Development Approach
Standalone Tool: The tool will be developed as a standalone application without integration into a larger project or system.
Budget: $1,000
Posted On: July 28, 2024 11:21 UTC
Category: AI Integration
Skills:Artificial Intelligence
Country: United Kingdom
click to apply
Powered by WPeMatico
