EMC Advanced Data Science and Big Data Analaytics
Details
This 5-day program builds on skills developed in the Data Science and Big Data Analytics course.
The main focus areas cover Hadoop (including Pig‚ Hive‚ and HBase)‚ Natural Language Processing‚ Social Network Analysis‚ Simulation‚ Random Forests‚ Multinomial Logistic Regression‚ and Data Visualization.
Taking on "Open" or technology-neutral approach‚ this course utilizes several open-source tools to address big data challenges.
Upon successful completion of this program‚ participants should be able to:
1. Develop and execute MapReduce functionality
2. Gain familarity with NoSQL databases and Hadoop Ecosystem tools for analyzing large-scale and unstructured data sets
3. Develop a working knowledge of Natural Language Processing‚ Social Network Analysis and Data Visualization concepts
4. Use advanced quantitative methods and apply one of them in a Hadoop environment
5. Apply advanced techniques to real-world datasets in a final lab
The following modules and lessons included in this program are designed to support the following objectives:
• MapReduce and Hadoop
• Hadoop Ecosystem and NoSQL
• Natural Language Processing
• Social Network Analysis
• Data Science Theory and Methods
• Data Visualization
Data Scientists‚ Data Analysts‚ Computer Scientists.
• Completion of the Data Science and Big Data Analytics program
• Proficiency in at least one programming language such as Java or Python
We are a company with a Sphere of Influence across Asia Pacific with thought leadership in developing and delivering best-in-class training programs to fulfil the talent demands in the human capital market.
At Knowledge Sphera, we endeavour to assist individuals to identify their core strengths and to map their skills progression, with a focus on the Art & Science of emerging technologies as well as relevant business skills, in order that they may soar in their personal and career journey.