Looking for an experienced data/ML enginner to help scrape data and text from webpages and unstructured sources (mainly PDF documents). The ideal candidate would be able to put in place a data architecture that can be regularly updated afterwards.
Required experience:
* Web scraping and data extraction
* Data engineering and databases
* Cloud - AWS/GCP
Preferred experience:
* Experience in using LLMs to capture data in a structured format
* LLM prompt engineering
This job is already closed and no longer accepting applicants, sorry.