Stars
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
DSPy: The framework for programming—not prompting—language models
Convert PDF to markdown + JSON quickly with high accuracy
Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
Automate browser based workflows with AI
OCR, layout analysis, reading order, table recognition in 90+ languages
State-of-the-Art Text Embeddings
Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.
GenAI Agent Framework, the Pydantic way
Nuitka is a Python compiler written in Python. It's fully compatible with Python 2.6, 2.7, 3.4-3.13. You feed it your Python app, it does a lot of clever things, and spits out an executable or exte…
Advanced Python Mastery (course by @dabeaz)
Claude Engineer is an interactive command-line interface (CLI) that leverages the power of Anthropic's Claude-3.5-Sonnet model to assist with software development tasks.This framework enables Claud…
The power of Claude Code / GeminiCLI / CodexCLI + [Gemini / OpenAI / OpenRouter / Azure / Grok / Ollama / Custom Model / All Of The Above] working as one.
Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.
A python library for user-friendly forecasting and anomaly detection on time series.
The property-based testing library for Python
Customisable coding font with alternates, ligatures and contextual positioning. Crazy crisp at 12px/9pt. http://larsenwork.com/monoid/
🔩 Like builtins, but boltons. 250+ constructs, recipes, and snippets which extend (and rely on nothing but) the Python standard library. Nothing like Michael Bolton.
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
a state-of-the-art-level open visual language model | 多模态预训练模型
Pydoll is a library for automating chromium-based browsers without a WebDriver, offering realistic interactions.
FastStream is a powerful and easy-to-use asynchronous Python framework for building asynchronous services interacting with event streams such as Apache Kafka, RabbitMQ, NATS and Redis.
Merlion: A Machine Learning Framework for Time Series Intelligence
NeuralProphet: A simple forecasting package