forked from rasbt/python-machine-learning-book
-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
Showing
14 changed files
with
44 additions
and
2 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,4 +1,4 @@ | ||
# What are differences in research nature between the two fields: Machine Learning & Data Mining? | ||
# What are differences in research nature between the two fields: Machine Learning & Data Mining? | ||
|
||
In a nutshell, Data Mining is about the discovery of patterns in datasets or "gaining knowledge and insights" from data. Machine Learning is closely related though. We can think of Machine Learning algorithms as one of he work horses of Data Mining; most Data Mining approaches are based on Machine Learning algorithms. Maybe it helps to think of Data Mining as a pipeline of steps and approaches, and the use of a Machine Learning algorithm is one part of this pipeline. | ||
Or in other words, Data Mining is not "just" Machine Learning. E.g., data visualization or summarization is also part of Data Mining. What I was trying to say is that Machine Learning is one part, one set of techniques, that is/are being used in Data Mining. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,40 @@ | ||
# How would your curriculum for a machine learning beginner look like? | ||
If I had to put together a study plan for a beginner, I would probably start with an easy-going intro course such as | ||
|
||
- Andrew Ng's [Machine Learning course on Coursera](https://class.coursera.org/ml-005/lecture) | ||
|
||
data:image/s3,"s3://crabby-images/a937b/a937b8360c87ef505fffc04ea052999a9a3760ff" alt="" | ||
|
||
Next, I would recommend a good intro book on 'Data Mining' (data mining is essentially about extracting knowledge from data, mainly using machine learning algorithms). I can highly recommend the following book written by one of my former professors: | ||
|
||
- P.-N. Tan, M. Steinbach, and V. Kumar. [Introduction to Data Mining](http://www-users.cs.umn.edu/~kumar/dmbook/index.php), (First Edition). Addison-Wesley Longman Publishing Co., Inc., Boston, MA, USA, 2005. | ||
|
||
data:image/s3,"s3://crabby-images/3c4a7/3c4a73a621310edfcf51446e06185d2a0a4eb319" alt="" | ||
|
||
This book will provide you with a great overview of what's currently out there; you will not only learn about different machine learning techniques, but also learn how to "understand" and "handle" and interpret data -- remember; without "good," informative data, a machine learning algorithm is practically worthless. Additionally, you will learn about alternative techniques since machine learning is not always the only and best solution to a problem | ||
|
||
> if all you have is a hammer, everything looks like a nail ... | ||
Now, After completing the Coursera course, you will have a basic understanding of ML and broadened your understanding via the Data Mining book. | ||
I don't want to self-advertise here, but I think my book would be a good follow-up to learn ML in more depth, understand the algorithms, learn about different data processing pipelines and evaluation techniques, best practices, and learn how to put in into action using Python, NumPy, scikit-learn, and Theano so that you can start working on your personal projects. | ||
|
||
data:image/s3,"s3://crabby-images/b5b5f/b5b5f35c59096eb159f393d8c016ac9d7b4634cb" alt="" | ||
|
||
While you work on your individual projects, I would maybe deepen your (statistical learning) knowledge via one of the three below: | ||
|
||
|
||
- T. Hastie, R. Tibshirani, J. Friedman, T. Hastie, J. Friedman, and R. Tibshirani. [The Elements of Statistical Learning](http://statweb.stanford.edu/~tibs/ElemStatLearn/), volume 2. Springer, 2009. | ||
- C. M. Bishop et al. [Pattern recognition and machine learning](http://www.springer.com/us/book/9780387310732), volume 1. springer New York, 2006. | ||
- Duda, Richard O., Peter E. Hart, and David G. Stork. [Pattern classification](http://www.wiley.com/WileyCDA/WileyTitle/productCd-0471056693.html). John Wiley & Sons, 2012. | ||
|
||
data:image/s3,"s3://crabby-images/bbb50/bbb509d1f388c7c7fe4b4f1e6afa71003f1b5e8d" alt="" | ||
|
||
When you are through all of that and still hungry to learn more, I recommend | ||
|
||
- [the Deep Learning book](http://www.iro.umontreal.ca/~bengioy/dlbook/) by Yoshua Bengio, Ian Goodfellow, and Aaron Courville. The release date is set around 2016, but the 613-page manuscript is already available as as of today (online and for free). | ||
|
||
data:image/s3,"s3://crabby-images/6a91a/6a91a17d325a06d287b09f2d64477d5caf66b5c2" alt="" | ||
|
||
- And in-between, if you are looking for a less technical yet very inspirational free-time read, I highly recommend [Pedro Domingo's The Master Algorithm: How the Quest for the Ultimate Learning Machine Will Remake Our World](https://homes.cs.washington.edu/~pedrod/) | ||
|
||
data:image/s3,"s3://crabby-images/7e919/7e9193a267c26a1ba21c623410e05e2e712c16e8" alt="" |
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters