¼øõÇâ´ëÇб³ ÄÄÇ»ÅÍ°øÇаú
ÀÌ»óÁ¤
[ºòµ¥ÀÌÅÍ ÀÌÇØ]
- °ÀǸñÇ¥
ºòµ¥ÀÌÅÍ ÄÄÇ»ÆÃÀÇ ±âº» °³³ä, ¿ø¸® ¹× ÀÀ¿ë ±â¹ýÀ» °ÀÇÇÑ´Ù. ÁÖ¿ä ³»¿ëÀ¸·Î´Â ºò
µ¥ÀÌÅÍÀÇ ±âº» °³³äÀ» ¼Ò°³ÇÏ°í, VirtualBox »óÀÇ Å¬·¯½ºÅÍ¿¡¼ ÇÏµÓ ºÐ»ê ÆÄÀÏ ½Ã½ºÅÛ°ú
¸Ê¸®µà½ºÀÇ ºòµ¥ÀÌÅÍ Ã³¸®, ÆÄÀ̽㠽ºÆÄÅ©(PySpark)¸¦
»ç¿ëÇÑ ºÐ»ê µ¥ÀÌÅÍ Ã³¸® ¹× ºÐ¼® ±â¹ý µîÀ» °ÀÇ, ½Ç½ÀÇÏ°í Àû¿ë »ç·Ê¸¦ »ìÆì º»´Ù.
¡¤
Âü°í»çÀÌÆ®
http://hadoop.apache.org/
¾ÆÆÄÄ¡ ÇϵÓ
http://spark.apache.org/
¾ÆÆÄÄ¡ ½ºÆÄÅ©
https://github.com/apache/spark
Git ½ºÆÄÅ© ÀúÀå¼Ò
https://research.google.com/
±¸±Û ¸®¼Ä¡
https://www.data.go.kr/
°ø°ø µ¥ÀÌÅÍ Æ÷ÅÐ
https://grouplens.org/
¹Ì³×¼ÒŸ ´ëÇÐ
GroupLens, ¿µÈ µ¥ÀÌÅÍ ¼¼Æ® Á¦°ø
http://archive.ics.uci.edu/ml
UCI Machine Learning Repository, ±â°èÇнÀ µ¥ÀÌÅÍ ¼¼Æ® Á¦°ø
https://physionet.org/physiobank/
»ýü½ÅÈ£ ¹× °ü·Ã µ¥ÀÌÅÍ Á¦°ø
¡¤
Æò°¡:
Ãâ¼® 10%, °úÁ¦ ¹× ¹ßÇ¥ 50%, ½ÃÇè 30%
|