https://prodi.gy
Prodigy · An annotation tool for AI, Machine Learning & NLP
A downloadable annotation tool for LLMs, NLP and computer vision tasks such as named entity recognition, text classification, object detection, image segmentation, evaluation and more.
Prodigy · An annotation tool for AI, Machine Learning & NLP 🚀Out now: v1.18Get ProdigyDocumentationFeaturesInformation Extraction Get structured data from textLanguage Model Training Train and fine-tune modelsComputer Vision Classify and segment imagesAudio & Video Classify and segment AV dataPrompt Engineering Develop better LLM promptsCustom Workflows Fully customize your experienceIndustriesBanking & Finance Financial data, risk assessment and market analysisHealthcare & Biomedical Medical documents, clinical notes and pharmaceutical dataMedia & Content Creation Multi-media data, archives and user-generated contentLegal & Insurance Legislative text, contracts, regulation and complianceConversation & Insights Dialog systems, user interactions and actionable insightsResearch & Education Academic papers, corpus creation and educational dataDemoBlogSupportSearchBuild AI systems that do exactly what you wantA modern data development experience from the makers of prodigyner.llm.correctnews_articles./config.cfg./news.jsonlThis live demo requires JavaScript to be enabled.Kim MillerDeveloperGPT-4APIEfficiently define, train and evaluateProdigy is an extensible annotation tool that gives you a new way to build custom AI systems. Define your classification scheme with real-world examples rather than just prompts, and let powerful models assist – no machine learning experience required.prodigytrain./information_extraction--ner news_ner--textcat news_textcat=========== Training pipeline ===========48% | ████████████████Information ExtractionType modelSize 436 MBSpeed 2k words/secondAccuracy 92.84 F-scoreDirectory ~/data/modelsAlex SmithEngineerTake back controlProdigy runs entirely under your control, making it suitable for even the strictest privacy requirements. You can download it and run it locally right out of the box, or adapt it to serve your infrastructure needs. The models you produce are yours as well, with absolutely no lock-in. recipe.py@prodigy.recipe( "my_custom_recipe" dataset=Arg(help="dataset to save answers to"), source=Arg("--source", help="data to load"), label=Arg("--label", "-l", help="comma-separated label(s)"),)def recipe(dataset: str, source: str, dataset: List[str]): ...Terminalprodigymy_custom_recipeannotations./samples.jsonl--label PERSON,PRODUCTJamie DavisDeveloperBuilt for customization and extensionProdigy lets you define fully custom data feeds and interfaces, letting the computer work instead of the human. By breaking down tasks into smaller pieces and automating whatever you can, you can make annotation over 10× as efficient. DocumentationDownloadable developer tool and libraryCreate, review and train from your annotationsRuns entirely on your own machinesPowerful built-in workflowsRead more PricingLifetime license, pay once, use foreverFlexible options for individuals and teamsFull privacy, no data leaves your serversDownload and install like any other libraryGet ProdigyReal-world case studiesHow S&P Global makes markets more transparent with spaCy and Prodigy in a high-security environmentHow the Guardian approaches quote extraction from news articles with spaCy and ProdigyHow Nesta processes 7m job ads to shed light on the UK’s labor market with spaCy and ProdigyHow Love Without Sound helps music industry law firms recover millions with spaCy and ProdigyHow Posh deploys a customized Prodigy cloud service to build financial chatbots for banking conversationsWhat others sayChristopher EwenSenior Product ManagerHaving a small model makes it much easier to achieve our strict inference SLAs. The system is much less operationally complex because the model is so efficient. Prodigy lets us automate as much as possible and focus on valuable decisions and less clicking.Andy HaltermanResearcherA lack of labeled data held geoparsing back for years. It took a week to fix that with Prodigy.India KerleData ScientistOur current work on measuring the greenness of jobs at the skill-, occupation- and industry-level relied heavily on Prodigy’s flexible custom recipes to incorporate Large Language Models (LLMs) in the labeling process.Anna VissensLead Data ScientistThe principle of human-in-the-loop machine learning is everywhere in journalism. For our AI projects, our data science team developed a fully customized hybrid rules and model-based annotation workflow with Prodigy.Cheyanne BairdNLP Research ScientistProdigy’s design aspect was key. [With my previous annotation tools], I would get a lot of feedback from annotators, saying “it’s really hard, because I have to scroll and scroll and scroll to see the labels. There’s too many labels. There’s too many options.” When I was looking at Prodigy I liked it because you could customize it.Raphael CohenHead of ResearchProdigy is by far the best ROI we had on any tool!Daniel BourkeFounderWe love Prodigy! I've tried many data labelling tools and chose Prodigy specifically for the simplicity. Image folder plus text file to database is perfect for our needs. If a model is one of our main products, good data is basically the same as good code.Antonio Polo de AlvaradoML EngineerI have been working with Prodigy these last few weeks and I can say that it is probably (if not the best) one of the best NLP tools.Rebecca BilbroFounder & CTOProdigy’s interface is incredibly intuitive! It elevates data labeling to a first-order concern in the ML workflow, enables us to collaborate on measures of inter-rater reliability and makes the labeling options super unambiguous for data annotators.User Survey ParticipantML EngineerI really love being able to do almost everything in Python, it means that team members with no front end experience can create tasks super easily.Jordan DavisFounderWhat I love about Prodigy is that it makes it really easy to try out ideas. You often don't know whether something works until you try it. Prodigy lets me iterate on my label schemes and definitions, and build much better models this way.Frequently Asked QuestionsAny other questions that are not covered here? Email us! What makes Prodigy different from other annotation solutions?Prodigy is a downloadable developer tool for creating training and evaluation data for machine learning models. You can use Prodigy to build custom AI systems specific to your use case that you can own and control. Prodigy is a Python package and library that includes a web application. You can customize Prodigy with your own Python functions, and mix and match frontend components to make your own annotation experience.Prodigy integrates tighly with spaCy, but can also be used with any other libraries and tools. The library includes a range of pre-built workflows and command-line commands for various common tasks, and components for implementing your own workflow scripts. Your scripts can specify how the data is loaded and saved and even define custom HTML and JavaScript. The web application is optimized for fast, intuitive and efficient annotation. Is our data really private? How does it work?Prodigy runs entirely on your own machines and never “phones home” or connects to our or any third-party servers. Once installed, you can even operate it on an entirely air-gapped machine without internet connection. All data and models you use and create stay entirely private and under your control. Which models can I use and train with Prodigy?Prodigy lets you train any models you can train in Python. It comes with first-class support for our NLP library spaCy via the built-in train recipe, as well as plugins for using and training Hugging Face models. It also integrates with the major Large Language Model (LLM) API providers out-of-the-box.All data you create is accessible via a convenient Python API and command-line interface, making it easy to implement training for custom models with standard libraries like PyTorch or TensorFlow, both in the cloud, as well as in local setups or environments like Jupyter. How customizable are Prodigy’s workflows and interfaces?Prodigy allows for extensive customization. A range of built-in settings makes it easy for non-experts to customize the experience, and the developer API and SDK lets you integrate the tool into your existing workflows and build powerful extensions for custom use cases.At the core of Prodigy’s developer experience are "recipes", Python functions that describe a workflow. Recipes can implement custom data processing and model training logic, integrate with third-party or internal libraries and tools and provide reusable workflows for your team that can be run without requiring programming or machine learning expertise. Prodigy also allows combining interfaces to build fully custom solutions, as well as implementing your own interactive interfaces with HTML, CSS and JavaScript. What expertise does my team need to use Prodigy?Prodigy is designed as a developer tool and assumes basic familiarity with the Python programming language and the command line. We also provide extensive documentation and examples to help you get started. Once you’ve set up an annotation task, the web application makes it easy for anyone to create annotations, no programming experience required. Which cloud providers does Prodigy support?Prodigy provides a standard Python web server and application that can be deployed on any cloud provider of your choice, including entirely on-premise. You can read more about deployment options and instructions here. Do you have special offers for researchers and universities?We’re always happy to support research projects, and researchers at degree-granting academic institutions can apply for an interim license to use Prodigy for free. To claim your research license, email us and include your university details.NavigationDocumentationGet ProdigyLive DemoSupport ForumCustom SolutionsBlogFeaturesInformation ExtractionLanguage Model TrainingComputer VisionAudio & VideoPrompt EngineeringCustom WorkflowsIndustriesBanking & FinanceHealthcare & BiomedicalMedia & Content CreationLegal & InsuranceConversation & InsightsResearch & EducationStatusTerms & ConditionsLegal & Imprint© 2017-2026 Explosion
1773993932
https://prodi.gy