Skip to main content
It looks like you're using Internet Explorer 11 or older. This website works best with modern browsers such as the latest versions of Chrome, Firefox, Safari, and Edge. If you continue with this browser, you may see unexpected results.
Welcome - Data Science Guide
This guide is designed to help connect Data Science students and faculty with Data Science resources in the Gangwish Library and on the web.
Find computer science articles in Full Text Finder
Full Text Finder
Once in Full Text Finder find the heading "Computer Science" in the list below the searchbox and click on it. The number next to it will be the number of full-text periodicals available under the heading.
Open Educational Resources & Tools
Data Science Ebooks at the Gangwish Library
R for Data Science by If you are an aspiring data scientist who wants to learn data science and numerical programming concepts through hands-on, real-world project examples, this is the book for you. Whether you are brand new to data science or you are a seasoned expert, you will benefit from learning about the structure of data science projects, the steps in the data science pipeline, and the programming examples presented in this book. Since the book is formatted to walk you through the projects with examples and explanations along the way, no prior programming experience is required.
Call Number: Ebooks
Publication Date: 2014-12-30
Scala Data Analysis Cookbook by Navigate the world of data analysis, visualization, and machine learning with over 100 hands-on Scala recipesAbout This Book- Implement Scala in your data analysis using features from Spark, Breeze, and Zeppelin- Scale up your data anlytics infrastructure with practical recipes for Scala machine learning- Recipes for every stage of the data analysis process, from reading and collecting data to distributed analyticsWho This Book Is ForThis book shows data scientists and analysts how to leverage their existing knowledge of Scala for quality and scalable data analysis.What You Will Learn- Familiarize and set up the Breeze and Spark libraries and use data structures- Import data from a host of possible sources and create dataframes from CSV- Clean, validate and transform data using Scala to pre-process numerical and string data- Integrate quintessential machine learning algorithms using Scala stack- Bundle and scale up Spark jobs by deploying them into a variety of cluster managers- Run streaming and graph analytics in Spark to visualize data, enabling exploratory analysisIn DetailThis book will introduce you to the most popular Scala tools, libraries, and frameworks through practical recipes around loading, manipulating, and preparing your data. It will also help you explore and make sense of your data using stunning and insightfulvisualizations, and machine learning toolkits.Starting with introductory recipes on utilizing the Breeze and Spark libraries, get to grips withhow to import data from a host of possible sources and how to pre-process numerical, string, and date data. Next, you'll get an understanding of concepts that will help you visualize data using the Apache Zeppelin and Bokeh bindings in Scala, enabling exploratory data analysis. iscover how to program quintessential machine learning algorithms using Spark ML library. Work through steps to scale your machine learning models and deploy them into a standalone cluster, EC2, YARN, and Mesos. Finally dip into the powerful options presented by Spark Streaming, and machine learning for streaming data, as well as utilizing Spark GraphX.Style and approachThis book contains a rich set of recipes that covers the full spectrum of interesting data analysis tasks and will help you revolutionize your data analysis skills using Scala and Spark.
Call Number: Ebook
Publication Date: 2015-10-30
Spark for Data Science by Analyze your data and delve deep into the world of machine learning with the latest Spark version, 2.0About This Book- Perform data analysis and build predictive models on huge datasets that leverage Apache Spark- Learn to integrate data science algorithms and techniques with the fast and scalable computing features of Spark to address big data challenges- Work through practical examples on real-world problems with sample code snippetsWho This Book Is ForThis book is for anyone who wants to leverage Apache Spark for data science and machine learning. If you are a technologist who wants to expand your knowledge to perform data science operations in Spark, or a data scientist who wants to understand how algorithms are implemented in Spark, or a newbie with minimal development experience who wants to learn about Big Data Analytics, this book is for you!What You Will Learn- Consolidate, clean, and transform your data acquired from various data sources- Perform statistical analysis of data to find hidden insights- Explore graphical techniques to see what your data looks like- Use machine learning techniques to build predictive models- Build scalable data products and solutions- Start programming using the RDD, DataFrame and Dataset APIs- Become an expert by improving your data analytical skillsIn DetailThis is the era of Big Data. The words 'Big Data' implies big innovation and enables a competitive advantage for businesses. Apache Spark was designed to perform Big Data analytics at scale, and so Spark is equipped with the necessary algorithms and supports multiple programming languages.Whether you are a technologist, a data scientist, or a beginner to Big Data analytics, this book will provide you with all the skills necessary to perform statistical data analysis, data visualization, predictive modeling, and build scalable data products or solutions using Python, Scala, and R.With ample case studies and real-world examples, Spark for Data Science will help you ensure the successful execution of your data science projects.Style and approachThis book takes a step-by-step approach to statistical analysis and machine learning, and is explained in a conversational and easy-to-follow style. Each topic is explained sequentially with a focus on the fundamentals as well as the advanced concepts of algorithms and techniques. Real-world examples with sample code snippets are also included.
Call Number: Ebook
Publication Date: 2016-09-30
Strategic Analytics and SAS by Use aggregate data to answer high-level business questions!Data miners, data scientists, analytic managers, and analysts who work in all industries will find the insights in Randy Collica's Strategic Analytics and SAS: Using Aggregate Data to Drive Organizational Initiatives invaluable in their work. This book shows you how to use your existing data at aggregate levels to answer high-level business questions. Written in a detailed, step-by-step format, the multi-industry use cases begin with a high-level question that a C-level executive might ask. Collica then progresses through the steps to perform the analysis, including many tables and screenshots to guide you along the way. He then ends each use case with the solution to the high-level question. Topics covered include logistic analysis, models developed from surveys, survival analysis, confidence intervals, text mining and analysis, visual analytics, hypothesis tests, and size and magnitude of analytic effects. Connect the dots between detailed data on your customers and the high-level business goals of your organization with Strategic Analytics and SAS!
Call Number: Ebook
Publication Date: 2016-09-21
Director of Library Services
Data Science News
Data Science Weekly
A free weekly newsletter featuring curated news, articles and jobs related to Data Science
No Free Hunch
Data Science news from Kaggle.com blog