Build software better, together
Here are 6,555 public repositories matching this topic...
Machine Learning From Scratch. Bare bones NumPy implementations of machine learning models and algorithms with a focus on accessibility. Aims to cover everything from linear regression to deep learning.
-
Updated
Oct 15, 2023 - Python
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
-
Updated
Dec 5, 2025 - Python
📝 An awesome Data Science repository to learn and apply for real world problems.
-
Updated
Jun 15, 2026
A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning
-
Updated
Jun 4, 2026
A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.
-
Updated
Jun 9, 2026 - C++
Topic Modelling for Humans
-
Updated
Nov 1, 2025 - Python
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committed to automating these high-value generic R&D processes through R&D-Agent, which lets AI drive data-driven AI. 🔗https://aka.ms/RD-Agent-Tech-Report
-
Updated
Jun 15, 2026 - Python
人工智能学习路线图,整理近200个实战案例与项目,免费提供配套教材,零基础入门,就业实战!包括:Python,数学,机器学习,数据分析,深度学习,计算机视觉,自然语言处理,PyTorch tensorflow machine-learning,deep-learning data-analysis data-mining mathematics data-science artificial-intelligence python tensorflow tensorflow2 caffe keras pytorch algorithm numpy pandas matplotlib seaborn nlp cv等热门领域
-
Updated
Jun 2, 2024
The "Python Machine Learning (1st edition)" book code repository and info resource
-
Updated
Nov 20, 2024 - Jupyter Notebook
A Python library for anomaly detection across tabular, time series, graph, text, and image data. 60+ detectors, benchmark-backed ADEngine orchestration, and an agentic workflow for AI agents.
-
Updated
Jun 5, 2026 - Python
A unified framework for machine learning with time series
-
Updated
Jun 15, 2026 - Python
Anomaly detection related books, papers, videos, and toolboxes. Last update late 2025 for LLM and VLM works!
-
Updated
Mar 2, 2026 - Python
A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.
-
Updated
Jun 16, 2026 - C++
Machine Learning for Cyber Security
-
Updated
Aug 19, 2024
🏅 Collection of Kaggle Solutions and Ideas 🏅
-
Updated
Jun 6, 2026 - Astro
🍊 📊 💡 Orange: Interactive data analysis
-
Updated
May 25, 2026 - Python
A library of extension and helper modules for Python's data analysis and machine learning libraries.
-
Updated
Jun 12, 2026 - Python
Curated list of Python resources for data science.
-
Updated
Jun 5, 2026
extract text from any document. no muss. no fuss.
-
Updated
May 7, 2026 - HTML
Alink is the Machine Learning algorithm platform based on Flink, developed by the PAI team of Alibaba computing platform.
-
Updated
Jun 7, 2024 - Java
Improve this page
Add a description, image, and links to the data-mining topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the data-mining topic, visit your repo's landing page and select "manage topics."