Chapter 1. Introduction -- chapter 2. The mother of invention's triplets : Moore's law, the proliferation of data, and data storage technology -- chapter 3. Hadoop -- chapter 4. HBase and other big data databases -- chapter 5. Machine learning -- chapter 6. Statistics -- chapter 7. Google -- chapter 8. Geographic information systems (GIS) -- chapter 9. Discovery -- chapter 10. Data quality -- chapter 11. Benefits -- chapter 12. Concerns -- chapter 13. Epilogue
0
SUMMARY OR ABSTRACT
Text of Note
With this book, managers and decision makers are given the tools to make more informed decisions about big data purchasing initiatives. Big Data Analytics: A Practical Guide for Managers not only supplies descriptions of common tools, but also surveys the various products and vendors that supply the big data market. Comparing and contrasting the different types of analysis commonly conducted with big data, this accessible reference presents clear-cut explanations of the general workings of big data tools. Instead of spending time on HOW to install specific packages, it focuses on the reasons WHY readers would install a given package. The book provides authoritative guidance on a range of tools, including open source and proprietary systems. It details the strengths and weaknesses of incorporating big data analysis into decision-making and explains how to leverage the strengths while mitigating the weaknesses. Describes the benefits of distributed computing in simple terms; includes substantial vendor/tool material, especially for open source decisions; covers prominent software packages, including Hadoop and Oracle Endeca; Examines GIS and machine learning applications; Considers privacy and surveillance issues. The book further explores basic statistical concepts that, when misapplied, can be the source of errors. Time and again, big data is treated as an oracle that discovers results nobody would have imagined. While big data can serve this valuable function, all too often these results are incorrect yet are still reported unquestioningly. The probability of having erroneous results increases as a larger number of variables are compared unless preventative measures are taken. The approach taken by the authors is to explain these concepts so managers can ask better questions of their analysts and vendors about the appropriateness of the methods used to arrive at a conclusion. Because the world of science and medicine has been grappling with similar issues in the publication of studies, the authors draw on their efforts and apply them to big data. -- from back cover