Give AlbumentationsX a star on GitHub โ€” it powers this leaderboard

Star on GitHub
Explosion

Explosion

Software company specializing in developer tools and tailored solutions for AI and Natural Language Processing

View on GitHub

Packages on Leaderboard (12)

RankPackageDownloadsStarsLanguage
839spacy18,353,57833,259Python
844murmurhash18,109,27645C++
869cymem17,419,125459Cython
871blis17,401,229234C
876catalogue17,289,513181Python
881preshed17,185,13587Cython
888thinc16,831,3602,891Python
889srsly16,818,284481Python
897wasabi16,553,610469Python
917spacy-loggers15,957,08112Python
947confection15,288,955193Python
1047weasel12,535,87891Python

Top GitHub repositories

RepositoryDescriptionStarsLanguage
explosion/spacy-course๐Ÿ‘ฉโ€๐Ÿซ Advanced NLP with spaCy: A free online course2,410Python
explosion/spacy-models๐Ÿ’ซ Models for the spaCy Natural Language Processing (NLP) library1,844Python
explosion/sense2vec๐Ÿฆ† Contextually-keyed word vectors1,673Python
explosion/projects๐Ÿช End-to-end NLP workflows from prototype to production1,420Python
explosion/spacy-transformers๐Ÿ›ธ Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy1,402Python
explosion/spacy-llm๐Ÿฆ™ Integrating LLMs into structured NLP pipelines1,365Python
explosion/curated-transformers๐Ÿค– A PyTorch library of curated Transformer models and their composable components894Python
explosion/spacy-layout๐Ÿ“š Process PDFs, Word documents and more with spaCy861Python
explosion/spacy-streamlit๐Ÿ‘‘ spaCy building blocks and visualizers for Streamlit apps854Python
explosion/spacy-stanza๐Ÿ’ฅ Use the latest Stanza (StanfordNLP) research models directly in spaCy746Python
explosion/prodigy-recipes๐Ÿณ Recipes for the Prodigy, our fully scriptable annotation tool504Jupyter Notebook
explosion/displacy:boom: displaCy.js: An open-source NLP visualiser for the modern web345JavaScript
explosion/floret๐ŸŒธ fastText + Bloom embeddings for compact, full-coverage vectors with spaCy335C++
explosion/prodigy-openai-recipesโœจ Bootstrap annotation with zero- & few-shot learning via OpenAI GPT-3323Python
explosion/lightnet๐ŸŒ“ Bringing pjreddie's DarkNet out of the shadows #yolo320C
explosion/spacy-notebooks๐Ÿ’ซ Jupyter notebooks for spaCy examples and tutorials288Jupyter Notebook
explosion/spacy-services๐Ÿ’ซ REST microservices for various spaCy-related tasks241Python
explosion/displacy-ent:boom: displaCy-ent.js: An open-source named entity visualiser for the modern web200CSS
explosion/tokenizationsRobust and Fast tokenizations alignment library for Rust and Python https://tamuhey.github.io/tokenizations/194Rust
explosion/jupyterlab-prodigy๐Ÿงฌ A JupyterLab extension for annotating data with Prodigy189TypeScript
explosion/spacymoji๐Ÿ’™ Emoji handling and meta data for spaCy with custom extension attributes183Python
explosion/wheelwright๐ŸŽก Automated build repo for Python wheels and source packages175Python
explosion/spacy-dev-resources๐Ÿ’ซ Scripts, tools and resources for developing spaCy126Python
explosion/spacy-lookups-data๐Ÿ“‚ Additional lookup tables and data resources for spaCy113Python
explosion/radicli๐Ÿ•Š๏ธ Radically lightweight command-line interfaces109Python
explosion/spacy-experimental๐Ÿงช Cutting-edge experimental spaCy components and features105Python
explosion/thinc-apple-ops๐Ÿ Make Thinc faster on macOS by calling into Apple's native Accelerate library102Cython
explosion/talks๐Ÿ’ฅ Browser-based slides or PDFs of our talks and presentations94JavaScript
explosion/healthseaHealthsea is a spaCy pipeline for analyzing user reviews of supplementary products for their effects on health.92Python
explosion/spacy-pkusegpkusegๅคš้ข†ๅŸŸไธญๆ–‡ๅˆ†่ฏๅทฅๅ…ท; The pkuseg toolkit for multi-domain Chinese word segmentation69Python
explosion/spacy-huggingface-pipelines๐Ÿ’ฅ Use Hugging Face text and token classification pipelines directly in spaCy63Python
explosion/spacy-rayโ˜„๏ธ Parallel and distributed training with spaCy and Ray56Python
explosion/ml-datasets๐ŸŒŠ Machine learning dataset loaders for testing and example scripts47Python
explosion/assets๐Ÿ’ฅ Explosion Assets45โ€”
explosion/spacy-huggingface-hub๐Ÿค— Push your spaCy pipelines to the Hugging Face Hub45Python
explosion/prodigy-pdfA Prodigy plugin for PDF annotation37Python
explosion/wikidGenerate a SQLite database from Wikipedia & Wikidata dumps.36Python
explosion/spacy-alignments๐Ÿ’ซ A spaCy package for Yohei Tamura's Rust tokenizations library 35Python
explosion/spacy-curated-transformersspaCy entry points for Curated Transformers32Python
explosion/spacy-vscodespaCy extension for Visual Studio Code32Python
explosion/vscode-prodigy๐Ÿงฌ A VS Code extension for annotating data with Prodigy30TypeScript
explosion/prodigy-hfTrain huggingface models on top of Prodigy annotations21Python
explosion/spacy-benchmarks๐Ÿ’ซ Runtime performance comparison of spaCy against other NLP libraries20Python
explosion/spacy-vectors-builder๐ŸŒธ Train floret vectors18Python
explosion/os-signpostWrapper for the macOS signpost API16Cython
explosion/prodigy-evaluate๐Ÿ”Ž A Prodigy plugin for evaluating spaCy pipelines13Python
explosion/curated-tokenizersLightweight piece tokenization library12Cython
explosion/prodigy-segmentSelect pixels in Prodigy via Facebook's Segment-Anything model.10Python
explosion/conll-2012A slightly cleaned up version of the scripts & data for the CoNLL 2012 Coreference task.9Python
explosion/thinc_gpu_ops๐Ÿ”ฎ GPU kernels for Thinc9C++
explosion/prodigy-whisperAudio transcription with OpenAI's whisper model in the loop. 5Python
explosion/prodigy-annA Prodigy pluging for ANN techniques5Python
explosion/princetondhCode for our presentation in Princeton DH 2023 April.4Jupyter Notebook
explosion/spacy-legacy๐Ÿ•ธ๏ธ Legacy architectures and other registered spaCy v3.x functions for backwards-compatibility4Python
explosion/prodigy-lunrA Prodigy plugin for document search via LUNR3Python
explosion/ec2buildwheelโ€”3Python
explosion/span-labeling-datasetsLoaders for various span labeling datasets2Python
explosion/.github:octocat: GitHub settings2โ€”
explosion/spacy-biaffine-parserโ€”1Python
explosion/aiGrunn-2023Materials for the aiGrunn 2023 talk on spaCy Transformer pipelines1Python
explosion/fastapi-explosion-extrasโ€”1Python
explosion/blisBLAS-like Library Instantiation Software Framework1C
explosion/spacy-io-binder๐Ÿ“’ Repository used to build Binder images for the interactive spaCy code examples1Jupyter Notebook
explosion/gha-cibuildwheelโ€”0โ€”
explosion/curated-transformers-addonsAdd-ons for Curated Transformers0Python
explosion/nginx_acm_ssl_proxyNginx container that allows for environmental variable use to set nginx configuration.0Shell