We are developing an AI copilot that enables attorneys to generate large contracts (50-70+ pages) with a few clicks of a button. We are looking for an experienced AI/ML/NLP developer for a 3-6 month project. You will work with a team of international experts. The ideal candidate will have experience with scraping data and working with language models / natural language processing concepts.
We currently have a web-based prototype that demonstrates core functionality: a user inputs a short text which is embedded. The embedding is compared (using a similarity search) against a database of contract clauses, the results of which are presented in an index. The user selects the contract clause. Our backend sends the two texts to a large language model and returns an enhanced text.
We are looking to develop an MVP based on our prototype. This will require the development of the following:
- A scraper capable of running a script or bot that crawls the internet, reads and extracts documents (large contracts), and saves the data as html and metadata.
- Text processing that converts the html into fully-formatted and structured .doc files, splits the text into smaller semantic units (as well as reconstitutes them), creates embeddings that represent the text as vectors, and extracts/masks/removes entities (names, addresses, amounts, etc.).
- An information retrieval system based on a semantic similarity search.
- A document workflow interface with a tree-like flow where a user can make selections (by inserting text, ticking boxes, etc.) that drive outcomes i.e. the workflow collects data by opening up branches / nodes using conditional logic.
- A Microsoft Word plug-in that sends user selections (made in the workflow) to an LLM/server via an API, the LLM/server generates text that it sends back to the plug-in, and the plug-in adds that text to a document in track changes.
- A backend that takes userâs selections/inputs (via the document workflow), generates hypothetical text based on a simple language model, sends that text to the LLM/server with a prompt.
We understand that no one developer may be capable of all of this, so we are open to receiving proposals for one or more aspects of this project.
To apply for this job, please submit a proposal detailing your experience and expertise in MVP development, specifically for the aspect for which you wish to be considered.
Please include links to past projects that demonstrate your ability to handle similar tasks.
This job is already closed and no longer accepting applicants, sorry.