[xgboost] XGBoost 설치 및 데모

데이터분석

[xgboost] XGBoost 설치 및 데모

쌍쌍바나나 2017. 2. 7. 21:32

Tensorflow와 함께 요즘 핫한 xgboost는 ensemble classification을 해주는 플랫폼이다. 기본 베이스로 decision tree를 사용하고 있다. 학습시키는것도 매우 간단하게 몇개의 파라미터만 넣으면 학습이된다. 근데 더 웃긴건 성능이 잘나온다는 거... 요즘 딥러닝도 뜨고 있지만 xgboost도 성능이 좋아서 kaggle같은 competition에서도 많이 사용하니 참고하면 좋을것 같다.

github.com/dmlc/xgboost

xgboost (decision tree)

설치하기

python의 pip로 설치가 가능하다.

설치를 하기 전에 c++ compiler가 필요하기 때문에 설치를 해야 한다.

mac에서는

$ brew install gcc —without-multilib

linux

$ sudo apt-get install build-essential

>> 나 설치하는데 47분 9초 걸림… 완전 장난아니네

이후에 아래와 같이 설치하면 끝,

$ sudo pip install xgboost

Command "python setup.py egg_info" failed with error code 1 in /private/tmp/pip-build-4RVqJZ/xgboost/

물론 업데이트를 하기 위해서는 github에서 build하는게 좋다.

[참고] https://github.com/dmlc/xgboost/tree/master/python-package

데모를 돌려보자

아주 다양하군

XGBoost python feature walkthrough

* Basic walkthrough of wrappers

* Cutomize loss function, and evaluation metric

* Boosting from existing prediction

* Predicting using first n trees

* Generalized Linear Model

* Cross validation

* Predicting leaf indices

* Sklearn Wrapper

* Sklearn Parallel

* Sklearn access evals result

* Access evals result

* External Memory

$ git clone https://github.com/dmlc/xgboost.git

xgboost/demo/guide-python을 확인하면 된다.

[참고] https://github.com/dmlc/xgboost/tree/master/demo/guide-python

저작자표시

'데이터분석' 카테고리의 다른 글

[부동산 데이터 분석] #2.재료만들기 (“11110” 아는사람은 Skip, 모르면 드루와) (0)	2018.12.02
[부동산 데이터 분석] #1.데이터의 출처 어디일까? 공공데이터 포털의 실체 (feat. Postman) 내집마련이 시급하다! (0)	2018.12.02
[Scikit-learn] large data set 학습시키는데 발생하는 문제 (0)	2017.02.07
데이터 용어 정리 (0)	2017.02.07
[데이터 분석] Python 라이브러리 - Pandas, Matplotlib, Numpy 10분만에 배우기 (0)	2016.07.07

현재글[xgboost] XGBoost 설치 및 데모

불로

함께 하는 블로그

RDD, 리눅스, javascript, ubuntu, 설치, git, TensorFlow, dict, Linux, 데이터분석, 파이썬, 자바스크립트, 빅데이터, nodejs, 소스코드, Python, Spark, 안드로이드, Android, 스파크,

Today :
Yesterday :

일	월	화	수	목	금	토
					1	2
3	4	5	6	7	8	9
10	11	12	13	14	15	16
17	18	19	20	21	22	23
24	25	26	27	28	29	30

불로