본문 바로가기

Kaggle4

New York City Taxi Trip Duration 코드리뷰 Competition : https://www.kaggle.com/c/nyc-taxi-trip-duration/overview Code : https://www.kaggle.com/gaborfodor/from-eda-to-the-top-lb-0-367 Description Data Field id - a unique identifier for each trip vendor_id - a code indicating the provider associated with the trip record pickup_datetime - date and time when the meter was engaged dropoff_datetime - date and time when the meter was disengage.. 2022. 2. 1.
House Prices - Advanced Regression Techniques 코드리뷰 2 Competition : https://www.kaggle.com/c/house-prices-advanced-regression-techniques/overview Code : https://www.kaggle.com/serigne/stacked-regressions-top-4-on-leaderboard Description Feature Engineering Process 데이터를 순차적으로 진행하여 결측치 대입 범주형으로 보이는 일부 수치형 변수 변환 order 정보를 가지고 있는 일부 카테고리형 변수 Label Encoding Skewed된 변수에 대한 Box Cox Transformation : 리더보드와 cross-validation에서 모두 약간 더 나은 결과를 제공 범주형 변수에 대한 더미변.. 2022. 1. 26.
Bike Sharing Demand 코드리뷰 Competition : https://www.kaggle.com/c/bike-sharing-demand/overview Code : https://www.kaggle.com/viveksrinivasan/eda-ensemble-model-top-10-percentile Description 1. About Dataset datetime : hourly date + timestamp season : 1 = spring, 2 = summer, 3 = fall, 4 = winter holiday : whether the day is considered a holiday workingday : whether the day is neither a weekend nor holiday weather : 1: Clea.. 2022. 1. 25.
House Prices - Advanced Regression Techniques 코드리뷰 Competition : https://www.kaggle.com/c/house-prices-advanced-regression-techniques/overview Code : https://www.kaggle.com/pmarcelino/comprehensive-data-exploration-with-python Description Dataset train.csv - the training set test.csv - the test set data_description.txt - 각 컬럼에 대한 자세한 설명 sample_submission.csv - a benchmark submission from a linear regression on year and month of sale, lot square .. 2022. 1. 17.