Hadoop-based systems present unique security challenges. How are massive distributed clusters secured? What are the risks posed by cloud-based systems? Are all the individual technologies used with Hadoop equally secure? And guess what: it better be right, because with big data comes big responsibility.
Hadoop Security is a serious handbook for programmers and IT managers who need to design and execute secure Hadoop clusters without sacrificing performance or ease of use. It starts with hands-on techniques for user security using extensions like Kerberos and OpenSSH. Then, it tackles important techniques like authorization and role based access control, audit logging, monitoring, and secure configurations. Finally, it dives into the all-important area of encryption, exploring key open source projects and a detailed case study.
Along the way, it surveys open source and commercial security tools.