Lists (1)
Sort Name ascending (A-Z)
Starred repositories
An opinionated list of awesome Python frameworks, libraries, software and resources.
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
A curated list of awesome Machine Learning frameworks, libraries and software.
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
An interactive TLS-capable intercepting HTTP proxy for penetration testers and software developers.
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials,…
Data Apps & Dashboards for Python. No JavaScript Required.
Best Practices on Recommendation Systems
Typer, build great CLIs. Easy to code. Based on Python type hints.
The interactive graphing library for Python ✨
Jupyter metapackage for installation and documentation
Python package built to ease deep learning on graph, on top of existing DL frameworks.
dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.
Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable,…
A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.
A unified framework for machine learning with time series
A Python library for the Docker Engine API
macOS development environment setup: Easy-to-understand instructions with automated setup scripts for developer tools like Vim, Sublime Text, Bash, iTerm, Python data analysis, Spark, Hadoop MapRed…
Voilà turns Jupyter notebooks into standalone web applications
A next-generation curated knowledge sharing platform for data scientists and other technical professions.
Probabilistic time series modeling in Python
pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (…
DeepVariant is an analysis pipeline that uses a deep neural network to call genetic variants from next-generation DNA sequencing data.
Generate embeddings from large-scale graph-structured data.