¼øõÇâ´ëÇб³ ÄÄÇ»ÅÍ°øÇаú ÀÌ»óÁ¤
[ºòµ¥ÀÌÅÍ ÀÌÇØ]
°úÁ¦Á¦Ãâ°Ô½ÃÆÇ
- °ÀǸñÇ¥
ºòµ¥ÀÌÅÍ ÄÄÇ»ÆÃÀÇ ±âº» °³³ä, ¿ø¸®
¹× ÀÀ¿ë ±â¹ýÀ» °ÀÇÇÑ´Ù. ÁÖ¿ä ³»¿ëÀ¸·Î´Â ºòµ¥ÀÌÅÍÀÇ ±âº» °³³ä°ú ÇÏµÓ ºÐ»ê ÆÄÀÏ ½Ã½ºÅÛ°ú
¸Ê¸®µà½º¸¦ ¼Ò°³ÇÑ ÈÄ ½ºÆÄÅ©¸¦ »ç¿ëÇÑ ºÐ»ê µ¥ÀÌÅÍ Ã³¸® ¹× ºÐ¼® ±â¹ý µîÀ» °ÀÇÇÑ´Ù. ¶ÇÇÑ
½ºÆÄÅ©¸¦ »ç¿ëÇÑ ºòµ¥ÀÌÅÍ ÆÄÀÌÇÁ¶óÀΠó¸® ±â¹ý°ú ½ºÆ®¸®¹Ö µ¥ÀÌÅÍ, NoSQL µ¥ÀÌÅͺ£À̽º, ¸Ó½Å ·¯´× ¹× ½Ç½Ã°£ ´ë½Ãº¸µå µîÀ» È°¿ëÇÑ ºòµ¥ÀÌÅÍ Àû¿ë »ç·Ê¸¦ °ÀÇÇÏ°í ½Ç½ÀÇÑ´Ù.
- ±³Àç
Spark: The Definitive Guide, Matei
Zaharia and Bill Chambers, O'Reilly, 2018
·¯´× ½ºÄ®¶ó, Á¦À̽¼ ½º¿ÍÃ÷, ±èÁ¤ÀÎ, °¼º¿ë ¿Å±è, Á¦ÀÌÆà, 2017.
MapR Academy,
Introduction to Big Data
MapR Academy, Developing Hadoop
Applications
Build and Monitor Apache Spark
Applications
Create Data Pipelines Using Apache
Spark
(°ÀdzëÆ® ´Ù¿î·Îµå)
°ÀÇ ³»¿ë
|
Âü°í »çÀÌÆ®
|
Âü°í ÀÚ·á
|
0. °ÀÇ ¼Ò°³
|
|
|
1. ºòµ¥ÀÌÅÍ ÄÄÇ»ÆÃ
¼Ò°³
|
MapR Academy,
Introduction to Big Data
GFS ³í¹®, Bigtable ³í¹®
|
|
2. ¾ÆÆÄÄ¡ ÇÏµÓ ¼Ò°³
|
MapR Academy, Introduction to Big Data
|
|
3-1.
Ŭ·¯½ºÅÍ
½Ç½À ȯ°æ
3-2. ¸®´ª½º ¸í·É °³¿ä
|
Oracle
VirtualBox
Ubuntu
|
|
4. ¸Ê¸®µà½º ¼Ò°³
|
MapR Academy, Developing Hadoop
Applications
MapReduce ³í¹®
|
|
5. ¸Ê¸®µà½º ÀÀ¿ë
±¸Ãà
|
MapR Academy, Developing Hadoop
Applications
|
receipts.txt
|
6. ¾ÆÆÄÄ¡ ½ºÆÄÅ©
¼Ò°³
|
MapR Academy, Apache Spark Essentials
Lesson 1: Introduction to Apache Spark
½ºÆÄÅ© ³í¹®
|
|
7.
½ºÄ®¶ó ÇÁ·Î±×·¡¹Ö ¾ð¾î ¼Ò°³
|
À©µµ¿ì¿ë SBT 1.1.1,
Learning Scala Materials / ½ºÄ®¶ó Çб³
|
|
8. ½ºÆÄÅ© ÇÁ·Î±×·¡¹Ö
±âÃÊ
|
MapR Academy, Apache Spark Essentials
Lesson 2: Load and Inspect Data
|
auctiondata.csv
|
Áß°£½ÃÇè
|
|
|
9.
½ºÆÄÅ© ÀÀ¿ë±¸Ãà
|
MapR Academy, Apache Spark Essentials
Lesson 3: Build a Simple Apache Spark Application
|
|
10. ½ºÆÄÅ© µ¥ÀÌÅÍÇÁ·¹ÀÓ
|
MapR Academy, Apache Spark Essentials
Lesson
5: Work with DataFrames
½ºÆÄÅ© SQL ³í¹®
|
sfpd.csv
|
11.
½ºÆÄÅ©
MLib
|
MapR Academy, Apache Spark Essentials
Lesson
10: Use Apache Spark MLib
|
movies.dat
ratings.dat
users.dat
|
½ºÆÄÅ© ½ºÆ®¸®¹Ö
|
MapR Academy, Apache Spark Essentials
Lesson 8:
Create an Apache Spark Streaming Application
|
|
HBase
µ¥ÀÌÅͺ£À̽º
|
Coreservelets.com Hadoop Tutorial: HBase Part 1,
2, 3
|
|
½ºÆÄÅ© GraphX
|
MapR Academy, Apache Spark Essentials
Lesson
9: Use Apache Spark GraphX
|
|
½ºÆÄÅ© ±¸Á¶È ½ºÆ®¸®¹Ö°ú Ä«ÇÁÄ«
|
Spark Structured Streaming Programming
Guide
Cloudurable Kafka Tutorial
|
|
½Ç½Ã°£ ¿ì¹ö ¸ð´ÏÅ͸µ ¿¹1 - ±â°èÇнÀ
|
End to End Application for Monitoring
Real-Time Uber Data using Apache APIs: Kafka,Spark,Hbase, part 1: Spark
Machine Learning
|
|
½Ç½Ã°£ ¿ì¹ö ¸ð´ÏÅ͸µ ¿¹ 2 - ½Ç½Ã°£ ºÐ¼®
|
End to End Application for Monitoring
Real-Time Uber Data using Apache APIs: Kafka,Spark,Hbase, part 2: Kafka and
Spark Streaming
|
|
½Ç½Ã°£ ¿ì¹ö ¸ð´ÏÅ͸µ ¿¹ 3
- Vert.x¸¦ ÀÌ¿ëÇÑ ½Ç½Ã°£ ´ë½Ãº¸µå
|
End to End Application for Monitoring
Real-Time Uber Data using Apache APIs: Kafka,Spark,Hbase, part3: Real-Time Dashboard
using Vert.x
|
|
½Ç½Ã°£ ¿ì¹ö ¸ð´ÏÅ͸µ ¿¹1 - HBase
|
End to End Application for Monitoring Real-Time Uber Data
Using Apache APIs: Kafka, Spark, HBase, part 4: Spark Streaming,
DataFrames, and HBase
|
|
¡¤
Âü°í»çÀÌÆ®
http://hadoop.apache.org/
¾ÆÆÄÄ¡
ÇϵÓ
http://spark.apache.org/
¾ÆÆÄÄ¡
½ºÆÄÅ©
https://github.com/apache/spark
Git ½ºÆÄÅ© ÀúÀå¼Ò
http://learn.mapr.com/
MapR ¾ÆÄ«µ¥¹Ì
https://databricks.com/
databricks
https://research.google.com/
±¸±Û
¸®¼Ä¡
http://www.scala-lang.org/
½ºÄ®¶ó
Learning Scala Materials
http://twitter.github.io/scala_school/ko/
½ºÄ®¶ó
Çб³
http://vertx.io/
Vert.x
https://www.data.go.kr/
°ø°ø µ¥ÀÌÅÍ Æ÷ÅÐ
https://grouplens.org/ ¹Ì³×¼ÒŸ ´ëÇÐ GroupLens, ¿µÈ µ¥ÀÌÅÍ ¼¼Æ® Á¦°ø
http://archive.ics.uci.edu/ml
UCI Machine Learning Repository, ±â°èÇнÀ µ¥ÀÌÅÍ ¼¼Æ® Á¦°ø
https://physionet.org/physiobank/
»ýü½ÅÈ£
¹× °ü·Ã µ¥ÀÌÅÍ Á¦°ø
https://machinelearningmastery.com/linear_algebra_for_machine_learning/
Basics of Linear Algebra for Machine Learning
¡¤
Æò°¡: Ãâ¼® ¹× °úÁ¦ 50%, ½ÃÇè 50%
|