专栏名称: 机器学习研究会
机器学习研究会是北京大学大数据与机器学习创新中心旗下的学生组织,旨在构建一个机器学习从事者交流的平台。除了及时分享领域资讯外,协会还会举办各种业界巨头/学术神牛讲座、学术大牛沙龙分享会、real data 创新竞赛等活动。
目录
相关文章推荐
宝玉xp  ·  回复@smart_tiger请问我今天写论文 ... ·  昨天  
黄建同学  ·  Andrew Brown是Meta ... ·  3 天前  
爱可可-爱生活  ·  今日推介(第1546期):利用自适应温度缩放 ... ·  1 周前  
爱可可-爱生活  ·  【探讨 LLM ... ·  1 周前  
51好读  ›  专栏  ›  机器学习研究会

​ 【推荐】如何选择机器学习算法

机器学习研究会  · 公众号  · AI  · 2017-04-13 19:01

正文



点击上方“机器学习研究会”可以订阅哦


摘要
 

转自:爱可可-爱生活

This resource is designed primarily for beginning data scientists or analysts who are interested in identifying and applying machine learning algorithms to address the problems of their interest.

A typical question asked by a beginner, when facing a wide variety of machine learning algorithms, is “which algorithm should I use?” The answer to the question varies depending on many factors, including:

  • The size, quality, and nature of data.

  • The available computational time.

  • The urgency of the task.

  • What you want to do with the data.

Even an experienced data scientist cannot tell which algorithm will perform the best before trying different algorithms. We are not advocating a one and done approach, but we do hope to provide some guidance on which algorithms to try first depending on some clear factors.


The machine learning algorithm cheat sheet


The machine learning algorithm cheat sheet helps you to choose from a variety of machine learning algorithms to find the appropriate algorithm for your specific problems. This article walks you through the process of how to use the sheet.


Since the cheat sheet is designed for beginner data scientists and analysts, we will make some simplified assumptions when talking about the algorithms.


The algorithms recommended here result from compiled feedback and tips from several data scientists and machine learning experts and developers. There are several issues on which we have not reached an agreement and for these issues we try to highlight the commonality and reconcile the difference.

Additional algorithms will be added in later as our library grows to encompass a more complete set of available methods.


How to use the cheat sheet

Read the path and algorithm labels on the chart as "If  then use ." For example:

  • If you want to perform dimension reduction then use principal component analysis.

  • If you need a numeric prediction quickly, use decision trees or logistic regression.

  • If you need a hierarchical result, use hierarchical clustering.

Sometimes more than one branch will apply, and other times none of them will be a perfect match. It’s important to remember these paths are intended to be rule-of-thumb recommendations, so some of the recommendations are not exact. Several data scientists I talked with said that the only sure way to find the very best algorithm is to try all of them.


链接:

http://blogs.sas.com/content/subconsciousmusings/2017/04/12/machine-learning-algorithm-use/


原文链接:

http://weibo.com/1402400261/EEjnBpy6u?from=page_1005051402400261_profile&wvr=6&mod=weibotime&type=comment#_rnd1492077060567

“完整内容”请点击【阅读原文】
↓↓↓