This course Provides instruction on the processes and practice of data science, including machine learning and natural languageprocessing. Included are: tools and programming languages (Python, IPython, Mahout, Pig, NumPy, pandas, SciPy, Scikitlearn), the Natural Language Toolkit (NLTK), and Spark MLlib.
Architects, software developers, analysts and data scientists who need to apply data science and machine learning on Hadoop.
Students must have experience with at least one programming or scripting language, knowledge in statistics and/or mathematics, and a basic understanding of big data and Hadoop principles. Students new to Hadoop are encouraged to attend the HDP Overview: Apache Hadoop Essentials course.
Students should be familiar with programming principles and have previous experience in software development. Experience with Linux and a basic understanding of DataFlow tools would be helpful. No prior Hadoop experience required, but is very helpful.
50% Hands-On Labs
How to Register:
- Click the "Purchase" button at the top of the page to initiate your purchase
- After you have completed your purchase and registration, you will be able to select the event that you wish to attend from the classes scheduled below after logging into your account