专栏名称: 机器学习研究会
机器学习研究会是北京大学大数据与机器学习创新中心旗下的学生组织,旨在构建一个机器学习从事者交流的平台。除了及时分享领域资讯外,协会还会举办各种业界巨头/学术神牛讲座、学术大牛沙龙分享会、real data 创新竞赛等活动。
目录
相关文章推荐
爱可可-爱生活  ·  【Claude和o3 ... ·  2 天前  
爱可可-爱生活  ·  【DeepSeek:比ChatGPT危险10 ... ·  2 天前  
宝玉xp  ·  //@高飞:OpenAI也是神奇,和谷歌的产 ... ·  3 天前  
51好读  ›  专栏  ›  机器学习研究会

【推荐】Kaggle机器学习数据集推荐

机器学习研究会  · 公众号  · AI  · 2017-11-19 18:20

正文



点击上方 “机器学习研究会” 可以订阅
摘要

转自:爱可可-爱生活

There are lots of machine learning ready datasets available to use for fun or practice on Kaggle's Public Datasets platform. Here is a short list of some of our favorites that we've already had the chance to review. They're all (mostly) cleaned and ready for analysis!

Binary Classification

  • Indian Liver Patient Records

  • Synthetic Financial Data for Fraud Detection

  • Business and Industry Reports

  • Can You Predict Product Backorders?

  • Exoplanet Hunting in Deep Space

  • Adult Census Income

Multiclass Classification

  • Iris Species

  • Fall Detection Data from China

  • Biomechanical Features of Orthopedic Patients

Regression

  • Video Game Sales with Ratings

  • NYC Property Sales

  • Gas Sensor Array Under Dynamic Gas Mixtures

NLP

  • The Enron Email Dataset

  • Ubuntu Dialogue Corpus

  • Old Newspapers: A cleaned subset of HC Corpora newspapers

  • Speech Accent Archive

  • Blog Authorship Corpus

Time Series Analysis

  • Cryptocurrency Historical Prices

  • Exoplanet Hunting in Deep Space

Image Processing

  • YouTube Faces with Facial Keypoints







请到「今天看啥」查看全文