E20-554 - Isilon Design Specialist Exam for Technology Architects

Go back to EMC

Example Questions

We have been engaged by a research hospital to help upgrade their Isilon installation. They currently have 12 previous generation Isilon nodes with 200TB of capacity and run on a 1 Gbps network. They currently have 6 Illumina Hi-Seq Sequencers and an HPC cluster to process data. They would like to expand Isilon to 2PB of active data and 1 PB of archive data. They use a third party data and metadata management service (I RODs) to stage data for analysis. The Isilon cluster is mainly used for analysis work with the HPC cluster. The customer will add two more Illumina Hi-Seq Sequencers in the next 6 months. Once these are added, what is the total amount of data from all the sequencers they can expect to add per month? A hospital is considering using Isilon to store their long-term archive data for their PACS application. They currently have two million 17MB studies. They expect capacity requirements to be 50% compounded growth per year. They plan to use N+2:1 protection, and would like this sized for three years of growth. Which minimum cluster configuration would meet this customer's environment? We have been engaged by a research hospital to help upgrade their Isilon installation. They currently have 12 previous generation Isilon nodes with 200TB of capacity and run on a 1 Gbps network. They currently have 6 Illumina Hi-Seq Sequencers and an HPC cluster to process data. They would like to expand Isilon to 2PB of active data and 1 PB of archive data. They use a third party data and metadata management service (I RODs) to stage data for analysis. The Isilon cluster is mainly used for analysis work with the HPC cluster. Before making any changes, the customer would like a graphical presentation of their existing array performance to present to management. What is the best tool to gather this information? You are working with a healthcare company that is interested in replacing all the backend storage for their PACS system. In general, how should the solution be presented to the customer? A potential customer requires 800 TB of usable capacity to store medical images for their network of health clinics. The IT department has limited staff and currently manages four storage arrays from other vendors. During a meeting with the Director of IT you learn that the company is considering a solution from a competitor of EMC using two of the existing arrays and two new arrays. Which Isilon capabilities would show a better ROI for the customer? You are asked about a potential new customer and you would like to get more information on the data skew of their filer. They have a supported filer. What assessment tool would you use to generate this information? In your second meeting with the customer you ask the following questions about their current unified SAN and NAS platform. "Are you using an application to map users from NFS Exports to SMB shares? Is each site a separate LDAP or AD Domain?" What functionality are you trying to understand in their environment and what is the comparable feature in IsilonOneFS? A media and entertainment company has asked for help in creating a new Video On Demand (VOD) storage system. The storage would be delivered to 5000 subscribers simultaneously. They would like to store 5000 hours of full length, High Definition (HD) movies. What would be the best price-to-performance node(s) for this solution? A potential customer requires 4 PB of usable capacity to store media files. The workloads are a combination of streaming video delivery and long term archive of video assets. There is very little space available in the customer's datacenter, so physical density is very important. The customer is concerned about performance impact and potential data loss when using 3 TB or larger drives. What Isilon capabilities can address the customer's concerns? Your customer has a new business unit. They are gathering web log files to be analyzed. They are using Hadoop to do the analysis of the logs. The IT department is centrally storing the log files on a five node Isilon cluster. They have enabled the Hadoop cluster to access the files directly. The nodes are X400's with two SSD drives in each node and 96GB of RAM. How many DataNodes does the Isilon Cluster have? RA A company currently has numerous file servers providing home directories for its employees. Some departments have their own file servers. All corporate users authenticate using Active Directory. The engineering department has their own untrusted AD domain for their file servers. The company is looking for a consolidated solution that requires minimal changes to the environment, and that creates a single namespace for the home directories. What is the recommended solution that will meet this company's requirements? When designing Isilon solutions, which issue should you pay special attention to? You meet a customer for the first time. They explain that their current environment for NAS does not meet their needs. You want to make sure that Isilon is a good fit for their needs. What would be a good prequalifying question to ask? RA When designing Isilon solutions, what issues should you pay special attention to? Your customer is a major metropolitan newspaper. Your previous discussion centered on their storage requirements, and you asked them some of the Top Ten questions. They responded: "Well, it's not the prettiest or most high-tech solution...we use Secure FTP (SFTP) to upload the stories and images and put them in a workflow that formats them for printing in the newspaper." "What really worries us is the fact that our Storage Array from your competitor is going out of maintenance in 120 days, and they are dropping support for Secure FTP. This isn't even considered in our D/R Plan. Do you have a better solution with Isilon?" Which Top Ten questions led to this information? RA You have gathered information from your customer about their current NAS environment. They indicated they are having performance and time-out issues with their clients accessing the storage. Currently over 5000 clients are simultaneously accessing the NAS; however, this will double in the next year. Based on this information, what recommendations for a new Isilon cluster would you give to the customer based on this information? Your customer is moving from a Restaurant/Bar business model to a Restaurant/Casino business model. You are helping architect the upgrade of a current 4-node X200 48 TB Isilon cluster, with 50% utilized for their video surveillance operations. They want to upgrade from 15 FPS (NTSC) in H.264 codec, with a 15 days retention policy to 30 FPS (NTSC) in H.264 codec, with a 30 days retention policy. How many nodes of the same type will you need to add in order to meet the required changes and not exceed 70% raw capacity? Your customer has an Isilon 4-node X400 cluster used for home directory use, and has since updated to OneFS 7.0. The HR and Legal departments have been very strict about using only dedicated file servers that are part of a single, isolated, untrusted Active Directory domain. Their filer has been out of maintenance, and fears are being raised that, due to the age, it may fail at some point. What could the IT department do, to allow the HR and Legal groups to provide file share services, with minimal impact to their other application permissions? RA A healthcare services provider is implementing an X-Series cluster. Currently they have approximately 200 customers and they are growing at 30% per year. They have 200 TB of storage utilized. Health Insurance Portability and Accountability Act (HIPAA) regulations require they keep all patient data for seven (7) years. They would like to keep all genomic sequencing data stored with N+3 protection, not to exceed 75% utilization. What is the minimum initial configuration you would recommend? Your customer is looking for a storage solution that will be able to store seven million three MB files which are written and seldom accessed. Read and write operations are both completed by a web based application, which requires 1.3 Gbps throughput. The customer's network has not been upgraded in many years, so the network interfaces are 1Gbps. Which cluster configuration would best meet the customer's requirements? RA Your customer manages a print media environment, consisting of three Isilon clusters, which are out of support. The customer would like to have access to new software releases and feature sets. You have been asked to perform a full discovery of the customer’s environment. The customer's current Isilon clusters are as follows: “Cust” (12 x 12000X) serves as upload media storage for different tenants. “Working” (8 x 12000X + 6 x X200) serves as a working zone for extraction to RAW and printing media from it. “Archive” (16 x 72NL + Accelerator nodes) is used to store printed content for six months. The customer operates in a Windows environment using SMB 2.0, two DNS servers per AD forest, three forest domains which are servicing three different environments. All servers are operating on a 1Gb network, three VLANs segregating the DEV/QA/PROD environments. Currently, there is no monitoring in place for performance measurement or optimization. The requirements for this solution include: • Better ROI and TCO • Maintain same performance with possible improvements • Renew HW/SW and get inclusive support • Limit migrations • Reduce space, power, cooling consumption • Get new feature sets • If migration required, use Parallel copy (multiple nodes, multiple threads, multiple connections) • Segregate tenant shares from other tenants • Expand up to 1PB of total storage What steps need to be taken on the clusters to meet the customer requirements? RA We have been engaged by a research hospital to help upgrade their Isilon installation. They currently have 12 previous generation Isilon nodes with 200TB of capacity and run on a 1 Gbps network. They currently have 6 Illumina Hi-Seq Sequencers and an HPC cluster to process data. They would like to expand Isilon to 2PB of active data and 1 PB of archive data. They use a third party data and metadata management service (I RODs) to stage data for analysis. The Isilon cluster is mainly used for analysis work with the HPC cluster. Currently, the workflows are being limited by the network access to the Isilon cluster. What would result in the fastest possible network throughput? Which protocols are supported by IsilonOneFS for file access? A potential customer has 540TB Raw capacity of NetApp running in ONTAP 7-Mode. They are willing to let you gather some workload data. Their authentication sources are LDAP and Active Directory. Their users are primarily Windows Clients. They are very interested in Automated Tiering and Large Scale Archives. They are not certain about the workflows from the Marketing Department. The Content Team will only say they need to acquire 300 TB within 90 days. They do not know the average file size or the number of aggregates they are managing. Which tool(s) would you use next to complete your sizing? You are asked to size a cluster for a file sharing environment nfsstat indicates that no more than 10% of the requests are namespace operations (e.g. GETADDR, SETADDR). There will be 10,000 active Linux users, connecting over NFS. Which cluster configuration would you recommend? A customer is inquiring about expanding their primary cluster consisting of six S-Series nodes in order to improve performance. They have a second cluster with 11 X-Series nodes. Both dusters leverage 1 GbE, but the customer recently installed a 10 GbE network that can be utilized. They also have multiple Fibre Channel SAN arrays in their environment. They have a variety of workloads on each cluster and are in the process of determining which workloads belong on which cluster. They have discovered that one of the workloads-a SOL database that resides on the X-Series node cluster - is experiencing timeouts due to latency. The database vendor has suggested limiting latency to 5ms or less to eliminate these timeouts. How do you advise the customer? Your customer is moving from a Restaurant/Bar business model to a Restaurant/Casino business model. You are helping architect the upgrade of a current 4-node X200 48 TB Isilon cluster, with 50% utilized for their video surveillance operations. They want to upgrade from 15 FPS (NTSC) in H.264 codec, with a 15 days retention policy to 30 FPS (NTSC) in H.264 codec with a 30 days retention policy. Which storage capacity would be the best suited to meet the required changes and utilize 65-75% capacity? A customer plans to consolidate 200 TB of digital images and video content. Performance is important for recently created files. However, the customer wants any content that has not been accessed within 30 days to be stored in the cluster at a much lower cost because performance is no longer critical. Which Isilon configuration should be recommended? A telecommunications company has a substantial amount of data. This data is being created by network elements within their environment. The company wants to change the way the network element’s Call Detail Records (CDR) are stored and analyzed. The existing infrastructure consolidates all of the CDRs into a table structure, and then ingests them into a large database. Once ingested, a query engine accesses the database and performs analysis on these files. The system is functional; however, since the amount of CDRs generated will increase exponentially over the next year, the company is open to alternatives for storing and analyzing these records. In evaluating alternatives, the key requirements are to reduce cost, the amount of storage, and the amount of time to analyze the data. The customer would like to use Hadoop to analyze the CDRs. After you have conducted an assessment of the workflow, you have recommended an Isilon Cluster to work within the Hadoop environment. Which protocols would be the best fit when using Isilon for this customer's Hadoop workflow? RA You set up a meeting to gather information on a new project with the IT manager and plan to use a workflow profile assessment (WPA) to document the requirements. Why is it recommended to talk to as many stakeholders as possible? A four-node Isilon cluster is being used in a Hadoop workflow with a parity protection of N+2:1. The HDFS protocol has been enabled and is being used to access data. According to EMC best practices, how many additional copies of data are written into the Isilon cluster using HDFS? An Isilon customer expressed an interest in more effectively managing their storage. Specifically, they would like to plan for future growth. Which tool would allow the customer to forecast capacity requirements? An Isilon customer is reporting less than expected performance from the cluster and poor space utilization. The customer is using three X400 nodes with N+2:1 protection policy to host webserver log data. Upon further analysis, it is discovered that the webservers write log files 64 KiB in size. The log files are then accessed by an analytics application for reporting. What should be done to increase write performance of the cluster? A cost-conscious customer is exploring Isilon for their PACS archive. The workflow consists of one hundred cases a day, each including fifty 60MB images files. However, each image will have five-hundred 64kB metadata files associated with it. They currently have six years worth or archived data. They will need to migrate to the new solution and they need to plan for an additional three years of archive capacity. Which solution would you recommend to fit their capacity needs? Your customer purchased an Isilon cluster with eight X400 nodes for Home Directories. Your follow-up discussions have uncovered an opportunity to expand the cluster to support the Legal Department. As you prepare to complete the Workflow Profile Document, you meet with the customer to discuss data protection. Which key areas should you address? You have had several meetings with an existing EMC VMAX customer. They have agreed that they would like to review an initial Isilon Solution for unstructured data used by the Product Development Team. All production is managed through a Global Enterprise Resource Planning system based in Germany. They require 600TiB of usable capacity. The File System must be accessible from Windows and Linux High-End Workstations. The files are generally not accessed after 120 days, and they would like automated tiering. Which questions about architectural integration will impact the solution design? (RIGHT ANSWER) You are designing a new Isilon system to store mobile device video clips. The files average 800MB in size and the application team tells you to plan for up to 15,000 new files per day. The product manager for this new service wants enough capacity for the first year and reminds you that response time for storing and retrieving the files is important for customer satisfaction. The files will be accessed frequently for the first 48 hours after creation, and then very infrequently after seven days. You plan to use N+2:1 protection, and will configure a SmartPools policy to move inactive files to an archive tier within the cluster. Which configuration best fits the project requirements? Your customer has a new business unit. They are gathering web log files to be analyzed. They are using Hadoop to do the analysis of the logs. The IT department is centrally storing the log files on a 5-node Isilon cluster. They have enabled the Hadoop cluster to access the files directly on the Isilon storage. The nodes are X400's with two SSD drives in each node and 96GB of RAM. How many DataNodes does the Isilon cluster have? A potential customer has about 540TB of NetApp running in ONTAP 7-Mode. They are willing to let you gather some workload data. They have LDAP and Active Directory authentication and primarily Windows Clients. They are very interested in Automated Tiering and Large Scale Archives. They are not certain about the workflows from the Marketing Department. The Content Team will only say they need to acquire 300 TB within 90 days. They do not know the average file size or the number of aggregates they are managing. Which tools would you use to gather information to do your Isilon sizing? A customer wants to add Unix users to their cluster, and requires that NFS access be fault-tolerant. What would you recommend? ( RIGHT ANS ) You are reviewing an opportunity with a trusted advisor. You both discover gaps in the initial solution design. Each iterative discussion with the customer helps you define the solution better. The documents produced capture the customer's requirements. What else needs to occur to refine the sizing considerations? A European Sports TV network is considering Isilon for their Media Edit Storage for their editing workloads. They are also considering Isilon for nearline media archiving. The network receives XDCAM HD footage which is loaded onto their existing Transcoding Storage platform. New footage that needs to be edited will be transferred from their Transcoding Storage platform to the proposed Media Edit Storage platform at the rate of ten simultaneous XDCAM HD files via FTP. Edited files will be transferred back to the Transcoding Storage platform at the rate often simultaneous XDCAM HD files via FTP. The network currently has 15 Final Cut Pro edit stations, 15 Avid edit stations. Proxy software will be used to allow Isilon to act as the storage for the Final Cut Pro and Avid media. Three of those edit stations will be performing content compositing as needed. The network expects 70 hours of new content per week, and 50 hours of edited content per week. They intend to keep the new and edited content on the proposed Media Edit Storage as a performance tier for 30 days. They would like to retain all new and edited footage proposed Media Edit Storage as a near-line tier for two years. In addition to the current workloads, the network expects to implement a new Media Asset Management (MAM) solution and has requested the Isilon cluster be capable of supporting 120 MBps read and 120 MBps write to support the MAM requirements. The customer wants to be able to put completed content into a non-editable folder. However, they want to give access to an administrator to delete content if required. What do you recommend to the customer? A potential customer uses a solution from an EMC competitor for NFS storage in their main data center. The existing arrays are five years old, and the customer would like to consolidate them into a single new system. The entire workload is generated by eight Linux hosts connected to the arrays that process video files. The customer informs you they are not able to collect performance information from the existing arrays. Which tools can capture the workload requirements? A company currently has multiple fileservers to provide home directories for its employees. Each department has a separate fileserver. Most corporate users are on Windows clients and utilize Active Directory, except for the Engineering department, which uses Linux and NIS for authentication. The company is looking for a solution with minimal administrative overhead and a single namespace for the home directories. What is the recommended solution that will meet this company's requirements? You meet with the IT Director of a large single site campus. The IT department has recently taken over Physical Security from the facilities department and plans a complete overhaul of the surveillance solution. They are looking to you as a storage architecture expert to provide the correct amount of storage needed. The IT Director asks you what items they need to provide in order to size this solution. What data do you require from the customer? Your customer manages a print media environment, consisting of three Isilon clusters, which are out of support. The customer would like to have access to new software releases and feature sets. You have been asked to perform a full discovery of the customer’s environment. The customer's current Isilon clusters are as follows: “Cust” (12 x 12000X) serves as upload media storage for different tenants. “Working” (8 x 12000X + 6 x X200) serves as a working zone for extraction to RAW and printing media from it. “Archive” (16 x 72NL + Accelerator nodes) is used to store printed content for six months. The customer operates in a Windows environment using SMB 2.0, two DNS servers per AD forest, three forest domains which are servicing three different environments. All servers are operating on a 1Gb network, three VLANs segregating the DEV/QA/PROD environments. Currently, there is no monitoring in place for performance measurement or optimization. The requirements for this solution include: • Better ROI and TCO • Maintain same performance with possible improvements • Renew HW/SW and get inclusive support • Limit migrations • Reduce space, power, cooling consumption • Get new feature sets • If migration required, use Parallel copy (multiple nodes, multiple threads, multiple connections) • Segregate tenant shares from other tenants • Expand up to 1PB of total storage RA When conducting high-level interviews with stakeholders of a project, what are the key questions that should be asked? What key considerations should be kept in mind when designing a 300TB solution? A customer plans to replace an existing array that is supported by the MiTrend Workload Profile Assessment (WPA) service. You receive performance data from the customer and run a WPA report that shows the array has 50TB of usable capacity. In discussions with the customer, you learn the content is comprised of four million 6MB files and 400 million 64KB files. The customer explains they have a small budget and very limited rack space available in their datacenter. Performance is not a concern because the files are rarely accessed. Which configuration provides the needed usable capacity using N+2:1 protection, and requires the least amount of rack space? RA While validating the value (VTV) with a large educational customer, they have decided they would like to add a Disaster Recovery (DR) site to their existing Isilon environment. The existing cluster contains six X-Series nodes. They plan to have all workflows replicated to the DR site and both clusters will have identical shares, exports, and user authentication. The customer would like to test DR Failover every six months by running their production environment from the DR site. What would you recommend for the DR solution?