AI and Machine Learning DeveloperClient – ManufacturingContractOutside IR352 month contract (Potential for more work)Rate – £600Remote Working (Potenial trip to Mansfield)ASAP Start DateThe Role – Our Client are seeking an experienced AI and Machine Learning Contractor, to work with our client, a leading manufacturing company, to develop a scalable system on Microsoft Azure. This system will parse large PDF documents, validate the contents against an established rule set, and provide actionable insights.The ideal candidate has a strong background in AI, machine learning, NLP (Natural Language Processing), and experience with cloud solutions on Azure. You will be working with the Solutions Architect on implementing the application based on an existing architecture blueprint and business rules, solidifying the solution design. During the prototyping phase, decision will be made between using LLM and a more custom ML approach for NLP.Key ResponsibilitiesSystem Architecture and Design
- Define the technology stack, tools, and libraries necessary for the validation processes.
- Ensure that the system meets both functional and non-functional requirements.
PDF Parsing and Text Extraction
- Develop a PDF text extraction module using Python or relevant libraries (e.g., PyMuPDF).
- Ensure accuracy in extracting text from various document structures, including tables, images, and multi-column layouts.
Rule-Based Content Validation
- Design and implement a rules engine for content validation, capable of cross-referencing extracted text against predefined rules.
- Collaborate with subject matter expert to translate business rules into machine-readable formats.
Natural Language Processing and LLM Integration
- Integrate LLMs (Large Language Models) to enhance understanding and interpretation of extracted text for nuanced rule validation.
- Leverage Azure OpenAI or other suitable LLM services to implement content parsing and language-based validation.
- Fine-tune NLP models where necessary to align with industry-specific terminology and the client’s validation criteria.
Data Processing and Workflow Automation
- Develop an automated pipeline on Azure for end-to-end document processing, from text extraction to validation and report generation.
- Utilize Azure services such as Azure Logic Apps, Azure Functions, and Azure Data Factory for workflow automation and seamless integration.
Data Security and Compliance
- Implement best practices for data privacy, security, and compliance, particularly around handling sensitive documents.
Performance Optimization and Monitoring
- Optimize algorithms and processing pipelines for quick turnaround.
- Set up monitoring tools and dashboards on Azure for the application health.
Required Skills and QualificationsTechnical Skills
- Machine Learning/NLP: Experience in AI and machine learning techniques, especially NLP, using libraries such as TensorFlow, PyTorch, spaCy, and Hugging Face.
- PDF Parsing: Hands-on experience with Python libraries for PDF extraction (e.g., PyMuPDF, pdfminer).
- Azure Cloud: Strong experience with Azure services, including Azure OpenAI, Azure Functions, Azure Logic Apps, Azure Data Factory, and Azure Cognitive Serv