Meet Libby. Learn more here
We want your feedback!
Click here
Cover Image

Doing Data Science

Straight Talk from the Frontline

by Cathy O'Neil Author · Rachel Schutt Author

ebook

Sign up to save your library

With an OverDrive account, you can save your favorite libraries for at-a-glance information about availability. Find out more about OverDrive accounts.

   Not today

Now that people are aware that data can make the difference in an election or a business model, data science as an occupation is gaining ground. But how can you get started working in a wide-ranging, interdisciplinary field that's so clouded in hype? This insightful book, based on Columbia University's Introduction to Data Science class, tells you what you need to know.

In many of these chapter-long lectures, data scientists from companies such as Google, Microsoft, and eBay share new algorithms, methods, and models by presenting case studies and the code they use. If you're familiar with linear algebra, probability, and statistics, and have programming experience, this book is an ideal introduction to data science.

Topics include:

  • Statistical inference, exploratory data analysis, and the data science process
  • Algorithms
  • Spam filters, Naive Bayes, and data wrangling
  • Logistic regression
  • Financial modeling
  • Recommendation engines and causality
  • Data visualization
  • Social networks and data journalism
  • Data engineering, MapReduce, Pregel, and Hadoop
  • Doing Data Science is collaboration between course instructor Rachel Schutt, Senior VP of Data Science at News Corp, and data science consultant Cathy O'Neil, a senior data scientist at Johnson Research Labs, who attended and blogged about the course.

    Publication Details

    Publisher:
    O'Reilly Media
    Publication Date:
    2013

    Format

    • Kindle Book
    • OverDrive Read 9.7 MB
    • Adobe PDF eBook 24.4 MB
    • Adobe EPUB eBook 9.7 MB
    • Open PDF eBook 24.4 MB
    • Open EPUB eBook 9.6 MB

    Rachel Schutt (Author)

    Cathy O'Neil earned a Ph.D. in math from Harvard, was postdoc at the MIT math department, and a professor at Barnard College where she published a number of research papers in arithmetic algebraic geometry. She then chucked it and switched over to...

    More about Rachel Schutt

    Cathy O'Neil (Author)

    Rachel Schutt is the Senior Vice President for Data Science at News Corp. She earned a PhD in Statistics from Columbia University, and was a statistician at Google Research for several years. She is an adjunct professor in Columbia's Department of...

    More about Cathy O'Neil
    Doing Data Science
    Copy and paste the code into your website.