Category: Data Science

Linear Regression with one Variable in Matlab
In this post i will show you how to implement one of the basic Machine Learning concepts in Matlab, the Linear Regression with one Variable. Matlab and Octave are very useful high-level languages for prototyping Machine Learning algorithms. [Linear Regression Example from mathworks.com] The idea is to find the line that perfectly fits all ...

Distribute Missing Values randomly across Columns in Python Pandas
Continuing to the previous post this script distributes missing values across all features in Pandas DataFrame. Just set min and max missing value distribution and you’re ready to execute … Enjoy…

Distribute Missing Values in Pandas DataFrame Column with Python
If you want to test how Maschine Learning algorithms perform with missing values you may need a script to distribute a fixed percentage of missing values in a feature. This script randomly distributes missing values in a single data set’s column. Enjoy…

Pandas DataFrame Subsampling in Python
Written long time ago to feed some ML algorithms with data subsets because the original data set was to huge and the algorithm execution performance was too long. Have fun with the script…

Feature Scaling in Python and Pandas DataFrame
Hire is a small script that i wrote long time ago to scale some of the features in order to get better performance and better prediction results in some ML algorithms. I used Python with Pandas to read in the CSV file and process feature values. Formulas for feature scaling used in the script can be found ...

How to crawl hotel names and urls from booking.com using Python and Selenium
You might be needing a list of all hotels in your city for any reason. Most of them can be found at booking.com (assuming it’s a city in Europe). If you need hotel names, ratings and/or hotel url list from any city you can crawl booking for it. Coding it with Python and selenium is pretty easy. Below ...