£50 Per hour
Undetermined
Remote
London Area, United Kingdom
Summary: The Automation Engineer (Python) will lead a text and table extraction project for a sustainability data platform utilized by global financial institutions. This role involves evaluating extraction tools, building automated pipelines, and deploying a system to convert raw documents into structured outputs. The position is remote, with a contract duration of three months and potential for extension. The ideal candidate will possess strong Python skills and experience in automated workflows and extraction tools.
Key Responsibilities:
- Designing and delivering an end-to-end extraction pipeline
- Comparing tools and libraries for parsing PDFs, Word docs, and more
- Automating validation and fallbacks for edge cases
- Collaborating with a small, agile engineering team
Key Skills:
- Strong Python skills and experience building automated workflows
- Hands-on knowledge of text and table extraction tools and libraries
- Ability to evaluate and communicate trade-offs between technical approaches
- Experience designing and implementing validation or human-in-the-loop systems
- Self-sufficient and able to deliver production-ready work with minimal oversight
Salary (Rate): £50.00/hr
City: London Area
Country: United Kingdom
Working Arrangements: remote
IR35 Status: undetermined
Seniority Level: undetermined
Industry: Other
Automation Engineer (Python) - Text & Table Extraction - 3 month contract
We’re hiring a Python engineer to lead a text and table extraction project for a sustainability data platform used by global financial institutions. You’ll evaluate cutting-edge extraction tools, build automated pipelines, and deploy a production-ready system to convert raw documents into structured, machine-readable outputs.
Key Details
- Industry: ESG / Financial Data
- Working model: Remote (UK preferred)
- Contract: 3 months (with potential extension)
- Tech: Python, pdfplumber, Tika, GROBID, etc.
- Start: ASAP
What You’ll Be Working On
- Designing and delivering an end-to-end extraction pipeline
- Comparing tools and libraries for parsing PDFs, Word docs, and more
- Automating validation and fallbacks for edge cases
- Collaborating with a small, agile engineering team
Requirements
- Strong Python skills and experience building automated workflows
- Hands-on knowledge of text and table extraction tools and libraries
- Ability to evaluate and communicate trade-offs between technical approaches
- Experience designing and implementing validation or human-in-the-loop systems
- Self-sufficient and able to deliver production-ready work with minimal oversight
This is a focused, impactful role for someone who wants to solve real-world problems and build something that matters.