Really cool visualization based on UCI Machine Learning Repository datasets:
http://kyrandale.com/viz/d3-smartphone-walking.html
Friday, December 26, 2014
Friday, December 12, 2014
Algorithm choosing
We have a lot of algorithms.
Great schema, which working not only for scikit, provide a simple decision process:
Image source
Great schema, which working not only for scikit, provide a simple decision process:
Image source
Friday, November 28, 2014
Helping Santa's Helpers Kaggle Competition
You can help elves in Santa Workshop pack toys in most efficient way and win $20,000.
Only 40 days left!
More about compettition: http://www.kaggle.com/c/helping-santas-helpers
Only 40 days left!
More about compettition: http://www.kaggle.com/c/helping-santas-helpers
Friday, November 21, 2014
Wednesday, November 12, 2014
Plotting multiple graphs on one page
R + ggplot2 is my favourite tools for building plots.
Today I need to have few graphs on one page.
Solution was found.
You can easily build plots, like that:
Today I need to have few graphs on one page.
Solution was found.
You can easily build plots, like that:
Wednesday, November 5, 2014
Coursera: Mining Massive Datasets
Extremely useful course for data scientist - Mining Massive Datasets by by Jure Leskovec, Anand Rajaraman and Jeff Ullman.
MapReduce
Link Analysis -- PageRank
Week 2:
Locality-Sensitive Hashing -- Basics + Applications
Distance Measures
Nearest Neighbors
Frequent Itemsets
Week 3:
Data Stream Mining
Analysis of Large Graphs
Week 4:
Recommender Systems
Dimensionality Reduction
Week 5:
Clustering
Computational Advertising
Week 6:
Support-Vector Machines
Decision Trees
MapReduce Algorithms
Week 7:
More About Link Analysis -- Topic-specific PageRank, Link Spam.
More About Locality-Sensitive Hashing
In addition, you can buy or download for free Mining Massive Datasets book from Mining Massive Datasets web-page .
Course Syllabus
Week 1:MapReduce
Link Analysis -- PageRank
Week 2:
Locality-Sensitive Hashing -- Basics + Applications
Distance Measures
Nearest Neighbors
Frequent Itemsets
Week 3:
Data Stream Mining
Analysis of Large Graphs
Week 4:
Recommender Systems
Dimensionality Reduction
Week 5:
Clustering
Computational Advertising
Week 6:
Support-Vector Machines
Decision Trees
MapReduce Algorithms
Week 7:
More About Link Analysis -- Topic-specific PageRank, Link Spam.
More About Locality-Sensitive Hashing
In addition, you can buy or download for free Mining Massive Datasets book from Mining Massive Datasets web-page .
Friday, October 17, 2014
Updating R on Ubuntu
[Tested on Ubuntu 14.04 and R 3.1.1, old R version 3.0.2]
sudo add-apt-repository ppa:marutter/rrutter
sudo apt-get update
sudo apt-get upgrade
sudo apt-get install r-base r-base-dev
Thursday, October 16, 2014
How to became a data scientists?
Coursera offers two specialization, which can help you:
- Data science by Johns Hopkins University (9 courses )
- Data mining by University of Illinois (5 courses)
Thursday, September 25, 2014
Data Scientists Demand
Here you can see Data Scientist position demand according to job site indeed.com:
Wednesday, September 24, 2014
Data Science and Great Career
Hello, my dear diary.
Today I found awesome presentation about way to success in Data Science.
Today I found awesome presentation about way to success in Data Science.
Monday, September 22, 2014
"Hello world" post
Hello, my dear diary.
Here I am going to describe my daily routine as a data scientist.
Here I am going to describe my daily routine as a data scientist.
Subscribe to:
Posts (Atom)