IBM Storage Solutions for SAS Analytics using IBM Spectrum Scale and IBM Elastic Storage System 3000 Version 1 Release 1

IBM Storage Solutions for SAS Analytics using IBM Spectrum Scale and IBM Elastic Storage System 3000 Version 1 Release 1

Author: Sanjay Sudam

Publisher: IBM Redbooks

Published: 2020-10-06

Total Pages: 26

ISBN-13: 0738459100

DOWNLOAD EBOOK

This IBM® Redpaper® publication is a blueprint for configuration, testing results, and tuning guidelines for running SAS workloads on Red Hat Enterprise Linux that use IBM Spectrum® Scale and IBM Elastic Storage® System (ESS) 3000. IBM lab validation was conducted with the Red Hat Linux nodes running with the SAS simulator scripts that are connected to the IBM Spectrum Scale and IBM ESS 3000. Simultaneous workloads are simulated across multiple x-86 nodes running with Red Hat Linux to determine scalability against the IBM Spectrum Scale clustered file system and ESS 3000 array. This paper outlines the architecture, configuration details, and performance tuning to maximize SAS application performance with the IBM Spectrum Scale 5.0.4.3 and IBM ESS 3000. This document is intended to facilitate the deployment and configuration of the SAS applications that use IBM Spectrum Scale and IBM Elastic Storage System (ESS) 3000. The information in this document is distributed on an "as is" basis without any warranty that is either expressed or implied. Support assistance for the use of this material is limited to situations where IBM Spectrum Scale or IBM ESS 3000 are supported and entitled and where the issues are specific to a blueprint implementation.


Book Synopsis IBM Storage Solutions for SAS Analytics using IBM Spectrum Scale and IBM Elastic Storage System 3000 Version 1 Release 1 by : Sanjay Sudam

Download or read book IBM Storage Solutions for SAS Analytics using IBM Spectrum Scale and IBM Elastic Storage System 3000 Version 1 Release 1 written by Sanjay Sudam and published by IBM Redbooks. This book was released on 2020-10-06 with total page 26 pages. Available in PDF, EPUB and Kindle. Book excerpt: This IBM® Redpaper® publication is a blueprint for configuration, testing results, and tuning guidelines for running SAS workloads on Red Hat Enterprise Linux that use IBM Spectrum® Scale and IBM Elastic Storage® System (ESS) 3000. IBM lab validation was conducted with the Red Hat Linux nodes running with the SAS simulator scripts that are connected to the IBM Spectrum Scale and IBM ESS 3000. Simultaneous workloads are simulated across multiple x-86 nodes running with Red Hat Linux to determine scalability against the IBM Spectrum Scale clustered file system and ESS 3000 array. This paper outlines the architecture, configuration details, and performance tuning to maximize SAS application performance with the IBM Spectrum Scale 5.0.4.3 and IBM ESS 3000. This document is intended to facilitate the deployment and configuration of the SAS applications that use IBM Spectrum Scale and IBM Elastic Storage System (ESS) 3000. The information in this document is distributed on an "as is" basis without any warranty that is either expressed or implied. Support assistance for the use of this material is limited to situations where IBM Spectrum Scale or IBM ESS 3000 are supported and entitled and where the issues are specific to a blueprint implementation.


Implementation Guide for IBM Elastic Storage System 3000

Implementation Guide for IBM Elastic Storage System 3000

Author: Brian Herr

Publisher: IBM Redbooks

Published: 2021-06-28

Total Pages: 84

ISBN-13: 0738458635

DOWNLOAD EBOOK

This IBM® Redbooks publication introduces and describes the IBM Elastic Storage® Server 3000 (ESS 3000) as a scalable, high-performance data and file management solution. The solution is built on proven IBM Spectrum® Scale technology, formerly IBM General Parallel File System (IBM GPFS). IBM Elastic Storage System 3000 is an all-Flash array platform. This storage platform uses NVMe-attached drives in ESS 3000 to provide significant performance improvements as compared to SAS-attached flash drives. This book provides a technical overview of the ESS 3000 solution and helps you to plan the installation of the environment. We also explain the use cases where we believe it fits best. Our goal is to position this book as the starting point document for customers that would use ESS 3000 as part of their IBM Spectrum Scale setups. This book is targeted toward technical professionals (consultants, technical support staff, IT Architects, and IT Specialists) who are responsible for delivering cost-effective storage solutions with ESS 3000.


Book Synopsis Implementation Guide for IBM Elastic Storage System 3000 by : Brian Herr

Download or read book Implementation Guide for IBM Elastic Storage System 3000 written by Brian Herr and published by IBM Redbooks. This book was released on 2021-06-28 with total page 84 pages. Available in PDF, EPUB and Kindle. Book excerpt: This IBM® Redbooks publication introduces and describes the IBM Elastic Storage® Server 3000 (ESS 3000) as a scalable, high-performance data and file management solution. The solution is built on proven IBM Spectrum® Scale technology, formerly IBM General Parallel File System (IBM GPFS). IBM Elastic Storage System 3000 is an all-Flash array platform. This storage platform uses NVMe-attached drives in ESS 3000 to provide significant performance improvements as compared to SAS-attached flash drives. This book provides a technical overview of the ESS 3000 solution and helps you to plan the installation of the environment. We also explain the use cases where we believe it fits best. Our goal is to position this book as the starting point document for customers that would use ESS 3000 as part of their IBM Spectrum Scale setups. This book is targeted toward technical professionals (consultants, technical support staff, IT Architects, and IT Specialists) who are responsible for delivering cost-effective storage solutions with ESS 3000.


Implementation Guide for IBM Elastic Storage System 3000

Implementation Guide for IBM Elastic Storage System 3000

Author: Brian Herr

Publisher:

Published: 2020

Total Pages: 95

ISBN-13:

DOWNLOAD EBOOK

This IBM® Redbooks publication introduces and describes the IBM Elastic Storage® Server 3000 (ESS 3000) as a scalable, high-performance data and file management solution. The solution is built on proven IBM Spectrum® Scale technology, formerly IBM General Parallel File System (IBM GPFS). IBM Elastic Storage System 3000 is an all-Flash array platform. This storage platform uses NVMe-attached drives in ESS 3000 to provide significant performance improvements as compared to SAS-attached flash drives. This book provides a technical overview of the ESS 3000 solution and helps you to plan the installation of the environment. We also explain the use cases where we believe it fits best. Our goal is to position this book as the starting point document for customers that would use ESS 3000 as part of their IBM Spectrum Scale setups. This book is targeted toward technical professionals (consultants, technical support staff, IT Architects, and IT Specialists) who are responsible for delivering cost-effective storage solutions with ESS 3000.


Book Synopsis Implementation Guide for IBM Elastic Storage System 3000 by : Brian Herr

Download or read book Implementation Guide for IBM Elastic Storage System 3000 written by Brian Herr and published by . This book was released on 2020 with total page 95 pages. Available in PDF, EPUB and Kindle. Book excerpt: This IBM® Redbooks publication introduces and describes the IBM Elastic Storage® Server 3000 (ESS 3000) as a scalable, high-performance data and file management solution. The solution is built on proven IBM Spectrum® Scale technology, formerly IBM General Parallel File System (IBM GPFS). IBM Elastic Storage System 3000 is an all-Flash array platform. This storage platform uses NVMe-attached drives in ESS 3000 to provide significant performance improvements as compared to SAS-attached flash drives. This book provides a technical overview of the ESS 3000 solution and helps you to plan the installation of the environment. We also explain the use cases where we believe it fits best. Our goal is to position this book as the starting point document for customers that would use ESS 3000 as part of their IBM Spectrum Scale setups. This book is targeted toward technical professionals (consultants, technical support staff, IT Architects, and IT Specialists) who are responsible for delivering cost-effective storage solutions with ESS 3000.


Implementation Guide for IBM Elastic Storage System 5000

Implementation Guide for IBM Elastic Storage System 5000

Author: Brian Herr

Publisher: IBM Redbooks

Published: 2020-12-08

Total Pages: 130

ISBN-13: 0738459224

DOWNLOAD EBOOK

This IBM® Redbooks® publication introduces and describes the IBM Elastic Storage® Server 5000 (ESS 5000) as a scalable, high-performance data and file management solution. The solution is built on proven IBM Spectrum® Scale technology, formerly IBM General Parallel File System (IBM GPFS). ESS is a modern implementation of software-defined storage, making it easier for you to deploy fast, highly scalable storage for AI and big data. With the lightning-fast NVMe storage technology and industry-leading file management capabilities of IBM Spectrum Scale, the ESS 3000 and ESS 5000 nodes can grow to over YB scalability and can be integrated into a federated global storage system. By consolidating storage requirements from the edge to the core data center — including kubernetes and Red Hat OpenShift — IBM ESS can reduce inefficiency, lower acquisition costs, simplify storage management, eliminate data silos, support multiple demanding workloads, and deliver high performance throughout your organization. This book provides a technical overview of the ESS 5000 solution and helps you to plan the installation of the environment. We also explain the use cases where we believe it fits best. Our goal is to position this book as the starting point document for customers that would use the ESS 5000 as part of their IBM Spectrum Scale setups. This book is targeted toward technical professionals (consultants, technical support staff, IT Architects, and IT Specialists) who are responsible for delivering cost-effective storage solutions with ESS 5000.


Book Synopsis Implementation Guide for IBM Elastic Storage System 5000 by : Brian Herr

Download or read book Implementation Guide for IBM Elastic Storage System 5000 written by Brian Herr and published by IBM Redbooks. This book was released on 2020-12-08 with total page 130 pages. Available in PDF, EPUB and Kindle. Book excerpt: This IBM® Redbooks® publication introduces and describes the IBM Elastic Storage® Server 5000 (ESS 5000) as a scalable, high-performance data and file management solution. The solution is built on proven IBM Spectrum® Scale technology, formerly IBM General Parallel File System (IBM GPFS). ESS is a modern implementation of software-defined storage, making it easier for you to deploy fast, highly scalable storage for AI and big data. With the lightning-fast NVMe storage technology and industry-leading file management capabilities of IBM Spectrum Scale, the ESS 3000 and ESS 5000 nodes can grow to over YB scalability and can be integrated into a federated global storage system. By consolidating storage requirements from the edge to the core data center — including kubernetes and Red Hat OpenShift — IBM ESS can reduce inefficiency, lower acquisition costs, simplify storage management, eliminate data silos, support multiple demanding workloads, and deliver high performance throughout your organization. This book provides a technical overview of the ESS 5000 solution and helps you to plan the installation of the environment. We also explain the use cases where we believe it fits best. Our goal is to position this book as the starting point document for customers that would use the ESS 5000 as part of their IBM Spectrum Scale setups. This book is targeted toward technical professionals (consultants, technical support staff, IT Architects, and IT Specialists) who are responsible for delivering cost-effective storage solutions with ESS 5000.


IBM Storage Solutions for Splunk Enterprise

IBM Storage Solutions for Splunk Enterprise

Author: IBM

Publisher: IBM Redbooks

Published: 2019-12-17

Total Pages: 28

ISBN-13: 0738458252

DOWNLOAD EBOOK

This document is intended to facilitate the deployment of the Splunk Enterprise Solutions using IBM All Flash Array systems for the Hot and Warm tiers, and IBM Elastic Storage System for the Cold and Frozen tiers. This document provides the reference architecture and configuration guidelines for the IBM Storage systems. The information in this document is distributed on an "as is" basis without any warranty that is either expressed or implied. Support assistance for the use of this material is limited to situations where IBM Storage Systems are supported, entitled and where the issues are specific to a blueprint implementation.


Book Synopsis IBM Storage Solutions for Splunk Enterprise by : IBM

Download or read book IBM Storage Solutions for Splunk Enterprise written by IBM and published by IBM Redbooks. This book was released on 2019-12-17 with total page 28 pages. Available in PDF, EPUB and Kindle. Book excerpt: This document is intended to facilitate the deployment of the Splunk Enterprise Solutions using IBM All Flash Array systems for the Hot and Warm tiers, and IBM Elastic Storage System for the Cold and Frozen tiers. This document provides the reference architecture and configuration guidelines for the IBM Storage systems. The information in this document is distributed on an "as is" basis without any warranty that is either expressed or implied. Support assistance for the use of this material is limited to situations where IBM Storage Systems are supported, entitled and where the issues are specific to a blueprint implementation.


IBM Elastic Storage Server Implementation Guide for Version 5.3

IBM Elastic Storage Server Implementation Guide for Version 5.3

Author: Luis Bolinches

Publisher: IBM Redbooks

Published: 2019-02-05

Total Pages: 102

ISBN-13: 0738457418

DOWNLOAD EBOOK

This IBM® RedpaperTM publication introduces and describes the IBM Elastic StorageTM Server as a scalable, high-performance data and file management solution. The solution is built on proven IBM SpectrumTM Scale technology, formerly IBM General Parallel File System (GPFSTM). IBM Elastic Storage Servers can be implemented for a range of diverse requirements, providing reliability, performance, and scalability. This publication helps you to understand the solution and its architecture and helps you to plan the installation and integration of the environment. The following combination of physical and logical components are required: Hardware Operating system Storage Network Applications This paper provides guidelines for several usage and integration scenarios. Typical scenarios include Cluster Export Services (CES) integration, disaster recovery, and multicluster integration. This paper addresses the needs of technical professionals (consultants, technical support staff, IT Architects, and IT Specialists) who must deliver cost-effective cloud services and big data solutions.


Book Synopsis IBM Elastic Storage Server Implementation Guide for Version 5.3 by : Luis Bolinches

Download or read book IBM Elastic Storage Server Implementation Guide for Version 5.3 written by Luis Bolinches and published by IBM Redbooks. This book was released on 2019-02-05 with total page 102 pages. Available in PDF, EPUB and Kindle. Book excerpt: This IBM® RedpaperTM publication introduces and describes the IBM Elastic StorageTM Server as a scalable, high-performance data and file management solution. The solution is built on proven IBM SpectrumTM Scale technology, formerly IBM General Parallel File System (GPFSTM). IBM Elastic Storage Servers can be implemented for a range of diverse requirements, providing reliability, performance, and scalability. This publication helps you to understand the solution and its architecture and helps you to plan the installation and integration of the environment. The following combination of physical and logical components are required: Hardware Operating system Storage Network Applications This paper provides guidelines for several usage and integration scenarios. Typical scenarios include Cluster Export Services (CES) integration, disaster recovery, and multicluster integration. This paper addresses the needs of technical professionals (consultants, technical support staff, IT Architects, and IT Specialists) who must deliver cost-effective cloud services and big data solutions.


IBM Storage Solutions for IBM Cloud Private Blueprint

IBM Storage Solutions for IBM Cloud Private Blueprint

Author: IBM Storage

Publisher: IBM Redbooks

Published: 2019-04-15

Total Pages: 72

ISBN-13: 0738457647

DOWNLOAD EBOOK

IBM Storage Solutions for IBM CloudTM Private delivers a blueprint for multicloud architecture. IBM, delivering solutions to help you win. In this blueprint, learn how to: Combine the benefits of IBM Systems with the performance of IBM Storage solutions so that you can deliver the right services to your clients today. Deliver optimized private cloud services ahead of schedule and under budget with a complete IBM Cloud Private stack. Containerize applications and deliver the SLAs that your team needs to thrive and win. Implement IBM Cloud Private to deploy modern applications like blockchain and AI or modernize what you already have. You now have the capabilities. This edition applies to IBM Storage Solutions for IBM Cloud Private Version 1 Release 5.0.


Book Synopsis IBM Storage Solutions for IBM Cloud Private Blueprint by : IBM Storage

Download or read book IBM Storage Solutions for IBM Cloud Private Blueprint written by IBM Storage and published by IBM Redbooks. This book was released on 2019-04-15 with total page 72 pages. Available in PDF, EPUB and Kindle. Book excerpt: IBM Storage Solutions for IBM CloudTM Private delivers a blueprint for multicloud architecture. IBM, delivering solutions to help you win. In this blueprint, learn how to: Combine the benefits of IBM Systems with the performance of IBM Storage solutions so that you can deliver the right services to your clients today. Deliver optimized private cloud services ahead of schedule and under budget with a complete IBM Cloud Private stack. Containerize applications and deliver the SLAs that your team needs to thrive and win. Implement IBM Cloud Private to deploy modern applications like blockchain and AI or modernize what you already have. You now have the capabilities. This edition applies to IBM Storage Solutions for IBM Cloud Private Version 1 Release 5.0.


IBM Elastic Storage System Introduction Guide

IBM Elastic Storage System Introduction Guide

Author: Stieg Klein

Publisher: IBM Redbooks

Published: 2022-11-21

Total Pages: 116

ISBN-13: 0738460931

DOWNLOAD EBOOK

This IBM® Redpaper Redbookspublication provides an overview of the IBM Elastic Storage® Server (IBM ESS) and IBM Elastic Storage System (also IBM ESS). These scalable, high-performance data and file management solution, are built on IBM Spectrum® Scale technology. Providing reliability, performance, and scalability, IBM ESS can be implemented for a range of diverse requirements. The latest IBM ESS 3500 is the most innovative system that provides investment protection to expand or build a new Global Data Platform and use current storage. The system allows enhanced, non-disruptive upgrades to grow from flash to hybrid or from hard disk drives (HDDs) to hybrid. IBM ESS can scale up or out with two different storage mediums in the environment, and it is ready for technologies like 200 Gb Ethernet or InfiniBand NDR-200 connectivity. This publication helps you to understand the solution and its architecture. It describes ordering the best solution for your environment, planning the installation and integration of the solution into your environment, and correctly maintaining your solution. The solution is created from the following combination of physical and logical components: Hardware Operating system Storage Network Applications Knowledge of the IBM Elastic Storage Server and IBM Elastic Storage System components is key for planning an environment. This paper is targeted toward technical professionals (consultants, technical support staff, IT Architects, and IT specialists) who are responsible for delivering cost-effective cloud services and big data solutions. The content of this paper can help you to uncover insights among client's data so that you can take appropriate actions to optimize business results, product development, and scientific discoveries.


Book Synopsis IBM Elastic Storage System Introduction Guide by : Stieg Klein

Download or read book IBM Elastic Storage System Introduction Guide written by Stieg Klein and published by IBM Redbooks. This book was released on 2022-11-21 with total page 116 pages. Available in PDF, EPUB and Kindle. Book excerpt: This IBM® Redpaper Redbookspublication provides an overview of the IBM Elastic Storage® Server (IBM ESS) and IBM Elastic Storage System (also IBM ESS). These scalable, high-performance data and file management solution, are built on IBM Spectrum® Scale technology. Providing reliability, performance, and scalability, IBM ESS can be implemented for a range of diverse requirements. The latest IBM ESS 3500 is the most innovative system that provides investment protection to expand or build a new Global Data Platform and use current storage. The system allows enhanced, non-disruptive upgrades to grow from flash to hybrid or from hard disk drives (HDDs) to hybrid. IBM ESS can scale up or out with two different storage mediums in the environment, and it is ready for technologies like 200 Gb Ethernet or InfiniBand NDR-200 connectivity. This publication helps you to understand the solution and its architecture. It describes ordering the best solution for your environment, planning the installation and integration of the solution into your environment, and correctly maintaining your solution. The solution is created from the following combination of physical and logical components: Hardware Operating system Storage Network Applications Knowledge of the IBM Elastic Storage Server and IBM Elastic Storage System components is key for planning an environment. This paper is targeted toward technical professionals (consultants, technical support staff, IT Architects, and IT specialists) who are responsible for delivering cost-effective cloud services and big data solutions. The content of this paper can help you to uncover insights among client's data so that you can take appropriate actions to optimize business results, product development, and scientific discoveries.


Monitoring Overview for IBM Spectrum Scale and IBM Elastic Storage Server

Monitoring Overview for IBM Spectrum Scale and IBM Elastic Storage Server

Author: Kedar Karmarkar

Publisher: IBM Redbooks

Published: 2017-07-28

Total Pages: 62

ISBN-13: 0738456306

DOWNLOAD EBOOK

IBM® Spectrum Scale is software-defined storage for high-performance, large-scale workloads. IBM SpectrumTM Scale (formerly IBM General parallel file system or GPFS) is a scalable data and file management solution that provides a global namespace for large data sets along with several enterprise features. IBM Spectrum ScaleTM is used in clustered environments and provides file protocol (POSIX, NFS, and SMB) and object protocol (Swift and S3) access methods. IBM Elastic StorageTM Server (ESS) is a software-defined storage system that is built upon proven IBM Power SystemsTM, IBM Spectrum Scale software, and storage enclosures. ESS allows for capacity scale up or scale out for performance in modular building blocks, which enables sharing for large data sets across workloads with unified storage pool for file, object, and Hadoop workloads. ESS uses erasure coding-based declustered RAID technology that was developed by IBM to rebuild failed disks in few minutes instead of days. IBM ESS and IBM Spectrum Scale are implemented in scalable environments that are running enterprise workloads. ESS and IBM Spectrum Scale are key components of the enterprise infrastructure. With growing expectations of availability on enterprise infrastructures, monitoring IBM Spectrum Scale, ESS health, and performance is an important function for any IT administrator. This IBM RedpaperTM publication provides an overview of key parameters and methods of IBM Spectrum Scale and ESS monitoring. The audience for this document is IT architects, IT administrators, storage administrators, and users who want to learn more about the administration of an IBM Spectrum Scale and ESS system. This document can be used to monitorfor the environments with IBM Spectrum Scale version 4.2.2.X0 or later. The examples in the document are based on IBM Spectrum Scale 4.2.2.X and ESS 5.0.X.X versions.


Book Synopsis Monitoring Overview for IBM Spectrum Scale and IBM Elastic Storage Server by : Kedar Karmarkar

Download or read book Monitoring Overview for IBM Spectrum Scale and IBM Elastic Storage Server written by Kedar Karmarkar and published by IBM Redbooks. This book was released on 2017-07-28 with total page 62 pages. Available in PDF, EPUB and Kindle. Book excerpt: IBM® Spectrum Scale is software-defined storage for high-performance, large-scale workloads. IBM SpectrumTM Scale (formerly IBM General parallel file system or GPFS) is a scalable data and file management solution that provides a global namespace for large data sets along with several enterprise features. IBM Spectrum ScaleTM is used in clustered environments and provides file protocol (POSIX, NFS, and SMB) and object protocol (Swift and S3) access methods. IBM Elastic StorageTM Server (ESS) is a software-defined storage system that is built upon proven IBM Power SystemsTM, IBM Spectrum Scale software, and storage enclosures. ESS allows for capacity scale up or scale out for performance in modular building blocks, which enables sharing for large data sets across workloads with unified storage pool for file, object, and Hadoop workloads. ESS uses erasure coding-based declustered RAID technology that was developed by IBM to rebuild failed disks in few minutes instead of days. IBM ESS and IBM Spectrum Scale are implemented in scalable environments that are running enterprise workloads. ESS and IBM Spectrum Scale are key components of the enterprise infrastructure. With growing expectations of availability on enterprise infrastructures, monitoring IBM Spectrum Scale, ESS health, and performance is an important function for any IT administrator. This IBM RedpaperTM publication provides an overview of key parameters and methods of IBM Spectrum Scale and ESS monitoring. The audience for this document is IT architects, IT administrators, storage administrators, and users who want to learn more about the administration of an IBM Spectrum Scale and ESS system. This document can be used to monitorfor the environments with IBM Spectrum Scale version 4.2.2.X0 or later. The examples in the document are based on IBM Spectrum Scale 4.2.2.X and ESS 5.0.X.X versions.


Hortonworks Data Platform with IBM Spectrum Scale: Reference Guide for Building an Integrated Solution

Hortonworks Data Platform with IBM Spectrum Scale: Reference Guide for Building an Integrated Solution

Author: Sandeep R. Patil

Publisher: IBM Redbooks

Published: 2018-06-26

Total Pages: 30

ISBN-13: 0738456969

DOWNLOAD EBOOK

This IBM® RedpaperTM publication provides guidance on building an enterprise-grade data lake by using IBM SpectrumTM Scale and Hortonworks Data Platform for performing in-place Hadoop or Spark-based analytics. It covers the benefits of the integrated solution, and gives guidance about the types of deployment models and considerations during the implementation of these models. Hortonworks Data Platform (HDP) is a leading Hadoop and Spark distribution. HDP addresses the complete needs of data-at-rest, powers real-time customer applications, and delivers robust analytics that accelerate decision making and innovation. IBM Spectrum ScaleTM is flexible and scalable software-defined file storage for analytics workloads. Enterprises around the globe have deployed IBM Spectrum Scale to form large data lakes and content repositories to perform high-performance computing (HPC) and analytics workloads. It can scale performance and capacity both without bottlenecks.


Book Synopsis Hortonworks Data Platform with IBM Spectrum Scale: Reference Guide for Building an Integrated Solution by : Sandeep R. Patil

Download or read book Hortonworks Data Platform with IBM Spectrum Scale: Reference Guide for Building an Integrated Solution written by Sandeep R. Patil and published by IBM Redbooks. This book was released on 2018-06-26 with total page 30 pages. Available in PDF, EPUB and Kindle. Book excerpt: This IBM® RedpaperTM publication provides guidance on building an enterprise-grade data lake by using IBM SpectrumTM Scale and Hortonworks Data Platform for performing in-place Hadoop or Spark-based analytics. It covers the benefits of the integrated solution, and gives guidance about the types of deployment models and considerations during the implementation of these models. Hortonworks Data Platform (HDP) is a leading Hadoop and Spark distribution. HDP addresses the complete needs of data-at-rest, powers real-time customer applications, and delivers robust analytics that accelerate decision making and innovation. IBM Spectrum ScaleTM is flexible and scalable software-defined file storage for analytics workloads. Enterprises around the globe have deployed IBM Spectrum Scale to form large data lakes and content repositories to perform high-performance computing (HPC) and analytics workloads. It can scale performance and capacity both without bottlenecks.