STAT 1291: Data Science

Lecture 1 - What is Data Science?

Sungkyu Jung

STAT 1291

Today

Data Science

20th Century Innovation

Engineering and Computer Science played key role

(https://dataorigami.net/blogs/napkin-folding/17543555-datas-use-in-the-21st-century)

But how about these 20th Century questions?

What is the difference?

Data

But

That’s what statisticians are already doing.

21st century

21st century

“I keep saying that the sexy job in the next 10 years will be statisticians,” said Hal Varian, chief economist at Google. “And I’m not kidding.”

Hal Varian says

“The ability to take data - to be able to understand it, to process it, to extract value from it, to visualize it, to communicate it’s going to be a hugely important skill in the next decades, not only at the professional level but even at the educational level for elementary school kids, for high school kids, for college kids. Because now we really do have essentially free and ubiquitous data.”

Data Science and Statistics

Typical data science project

More Tweets, More Votes

Is social media a valid indicator of political behavior?

Imagine that you were asked to answer

Exercise

(https://ssrn.com/abstract=2235423)

(http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0079449)

Final question:

How would you reproduce this study?