All posts by dzhamzic

bits & notes

Pandas DataFrame Subsampling in Python

Written long time ago to feed some ML algorithms with data subsets because the original data set was to huge and the algorithm execution performance was too long. Have fun with the script… Advertisements

Feature Scaling in Python and Pandas DataFrame

Hire is a small script that i wrote long time ago to scale some of the features in order to get better performance and better prediction results in some ML algorithms. I used Python with Pandas to read in the CSV file and process feature values. Formulas for feature scaling used in the script can be found ...


How to crawl Hotel Informations from booking using Python and Selenium

This post continues on the last one. Assuming you have the hotel list with urls from booking you can now extract addresses for each hotel. This address can be further converted to latitude and longitude since geo information that can be crawled from booking is not quite right. The script below extracts the address and hotel star ...


How to crawl hotel names and urls from using Python and Selenium

You might be needing a list of all hotels in your city for any reason. Most of them can be found at (assuming it’s a city in Europe). If you need hotel names, ratings and/or hotel url list from any city you can crawl booking for it. Coding it with Python and selenium is pretty easy. Below ...