专栏名称: 机器学习研究会
机器学习研究会是北京大学大数据与机器学习创新中心旗下的学生组织,旨在构建一个机器学习从事者交流的平台。除了及时分享领域资讯外,协会还会举办各种业界巨头/学术神牛讲座、学术大牛沙龙分享会、real data 创新竞赛等活动。
目录
相关文章推荐
机器之心  ·  奥特曼赢家通吃!OpenAI再揽66亿美元新 ... ·  22 小时前  
爱可可-爱生活  ·  [CL]《On the ... ·  昨天  
爱可可-爱生活  ·  晚安~ #晚安# -20241001222204 ·  2 天前  
黄建同学  ·  pearai-app,另一个有名的开源 ... ·  1 周前  
51好读  ›  专栏  ›  机器学习研究会

【推荐】Kaggle机器学习数据集推荐

机器学习研究会  · 公众号  · AI  · 2017-11-19 18:20

正文



点击上方“机器学习研究会”可以订阅
摘要
 

转自:爱可可-爱生活

There are lots of machine learning ready datasets available to use for fun or practice on Kaggle's Public Datasets platform. Here is a short list of some of our favorites that we've already had the chance to review. They're all (mostly) cleaned and ready for analysis!

Binary Classification

  • Indian Liver Patient Records

  • Synthetic Financial Data for Fraud Detection

  • Business and Industry Reports

  • Can You Predict Product Backorders?

  • Exoplanet Hunting in Deep Space

  • Adult Census Income

Multiclass Classification

  • Iris Species

  • Fall Detection Data from China

  • Biomechanical Features of Orthopedic Patients

Regression

  • Video Game Sales with Ratings

  • NYC Property Sales

  • Gas Sensor Array Under Dynamic Gas Mixtures

NLP

  • The Enron Email Dataset

  • Ubuntu Dialogue Corpus

  • Old Newspapers: A cleaned subset of HC Corpora newspapers

  • Speech Accent Archive

  • Blog Authorship Corpus

Time Series Analysis

  • Cryptocurrency Historical Prices

  • Exoplanet Hunting in Deep Space

Image Processing

  • YouTube Faces with Facial Keypoints

  • Fashion MNIST

Mapping and Prediction

  • Seattle Police Department 911 Incident Response

  • Baltimore 911 Calls

  • Crimes in Chicago

  • Philadelphia Crime Data

  • London Crime

Large Datasets

  • Iowa Liquor Sales

  • Seattle Library Checkout Records


链接:

https://www.kaggle.com/annavictoria/ml-friendly-public-datasets/


原文链接:

https://m.weibo.cn/1402400261/4175682756483267

“完整内容”请点击【阅读原文】
↓↓↓