本帖最后由 CathyS 于 2016-1-12 12:38 编辑
课程概述 进入互联网时代,就不得不提三件事:物联网、云计算和大数据。同样,市场的需求地扩张和科技发展的日新月异也使得创业者和公司CEO出身悄然发生了巨大变化。尤其是在信息大量频繁交互的时代,更多的商业决策就必须倚赖数据分析,更甚者,一些能力很强的Data Scientis凭借自身的背景和知识,从单纯的技术人员成长为商业决策者和领导者。
大弓科技Techbow旨在为留美华人提供最前沿技术的拓展,为帮助华人立足硅谷贡献自己的一份力量。与此同时,大弓也推出 大数据之Data Scientist 在线互动课程,无论小伙伴身处何方,都可以加入我们的行列。
免费公开课:1/16(周六)上午9点-11点 (PST), 11点后为收费课程 正式开课:1/16(周六)开始,每周六上午9:00~12:00,共8次课程
老虎老师背景介绍 Duke大学博士,Paypal Data Scientist Data scientist 8 weeks training Syllabus
课程安排 Python eco-system for data analytics 1. What is data scientist 2. Numpy, scipy, pandas, ipython book 3. Basic analytics using pandas 4. Case study using pandas
Data Exploration & Inferential statistics 1. data processing: sampling and transforming 2. Visualizaiton tools 3. Exploratory analysis 4. Estiamtion 5. Hypothesis testing 6. Correlation
Machine learning: classification 1. Naïve Bayes 2. Decision Trees 3. Logistic Regression 4. Neural Network 5. SVM
Machine learning: regression, clustering, PCA 1. linear regression 2. Nonlinear regression 3. clustering algorithms 4. princial component analysis
Data Analysis using sql, hadoop, and Pig 1. principles and system component 2. Pig Design patterns for streaming data 3. MapReduce pattern 4. Case study, implement data ingestion using Pig
Data processing and ML in Sklearn 1. loading and analyzing data 2. Learning and predicting 3. Model selection and parameter tuning 4. Case study, build models in sklearn
Complete case study Application of machine learning techniques in classification and prediction ML in Spark 1. Spark’s principles and components 2. implementation of real time streaming data analytics by Spark 3. Machine learning 4. Case study
|