C2090-421 - InfoSphere DataStage v8.5 Exam
Go back to IBM
Configuring the weighting column of an Aggregator stage affects which three options. (Choose three.)
Sum of Weights
Percent Coefficient of Variation
You have a 3TB dataset hash-partitioned on CustID in a clustered environment. You need to join this dataset with 1GB of reference data on OrderID. Which technique is most appropriate?
Use Lookup stage,select auto partitioning for the stream link and entire partitioning for the reference link.
You are about to begin major changes to jobs in a project. You want to conveniently identify job changes on an ad hoc basis. What two tasks will allow you to identify changes to your jobs? (Choose two.)
Select the job,then right click Compare within.
Before making a change to a job make a copy of the job in a different category folder.
There is a requirement to transfer a large file using an FTP Enterprise stage. How can you minimize processing time when a transfer failure occurs?
Manually split the file into multiple files and specify restartable mode on a transfer.
A job design consists of an input SequentialFile stage, followed by a Peek stage, a Transformer stage, a Sort stage and finally an output Dataset stage. The job executes on an SMP system with a configuration file defined with 4 nodes. The environment variable $APT_DISABLE_COMBINATION is set to "True". How many processes will this job create?
You are responsible for managing job changes for you project team. You must update the source code repository through Eclipse Team source code integration whenever a developer notifies you of an asset change. What two tasks allow you to update the source code repository with the changed asset? (Select two)
Right-click the asset and click the Build command
Right-click the asset and click the Team command to commit the artifact to the source control system.
The purchase history record contains CustID, ProductID, ProductType and TotalAmount. You need to retain the record of greatest TotalAmount per CustID and ProductType using RemoveDuplicate stage. Which two statements accomplish this requirement? (Choose two.)
Hash-partition on CustID;Sort on CustID,ProductType and TotalAmount.
Hash-partition on CustID and ProductType;Sort on CustID,ProductType and TotalAmount.
How are Host Systems added to the Shared Metadata repository? (Select two)
Select Repository > Metadata Sharing > Management from the main menu to open the Shared Metadata Management tool
When you configure a domain for source code integration a source control workspace is created. What are two reasons for the source control workspace? (Choose two.)
local transfer area for assets being submitted to the source control system
provides a place to store the archive created by IS source code control integration
Which three of the following actions are available when editing a message handler? (Select three)
Promote to warning
Suppress from job log
Demote to informational
In which two situations would you use the Web Services Client stage? (Choose two.)
You do not need both input and output links in a single web service operation.
You need the Web service to act as either a data source or a data target during an operation.
Which condition is required to use the LastRowInGroup() function to determine a key break on a field?
The input data has to be sorted by the key field and DataStage must be able to detect that the data is sorted,for example,by the presence of a Sort stage.
What are the three features of Information Server that allow you to deploy DataStage objects? (Select three)
Information Server Manager
Which two statements are true about DataStage parallel routines? (Choose two.)
Parallel routines can be written that are called before or after a stage runs.
Parallel routines can be written that are called from within the Transformer stage.
Your job uses the MQ connector stage to read messages from an MQ queue. The job should retrieve the message ID into the MessageID field and parse the payload into two fields: Name is to get the first ten characters, Description is to get the remaining characters. What will accomplish this?
First column is MessageID; select the WSMQ.MSGID data element for the Message ID field; column is Description as VarBinary 200; third column is Name as Binary 10.
First column is MessageID as Binary 24; second column is Name as Binary 10; select WSMG. MSPAYLOAD data element; third column is Description as VarBinary 200; select WSMG.MSPAYLOAD data element.
You have a parallel shared container that is used by other parallel jobs within your project. Part of the logic in this shared container has been changed. Which two statements are true regarding this change to the parallel shared container? (Choose two.)
Jobs using this parallel shared container need to be re-compiled.
Use usage analysis and multi-compile to re-compile job using this shared container.
Input rows to a Transformer contain a product name field and a field with a list of colors the product can be ordered with. The colors are separated by the pipe character (|). Here is an example of an input row: "Shirt"| ....| "Red, Blue, Black"|... For each input row, you want to output multiple output rows, one for each color in the list. For the above example input row, three rows are to be output, one per color: "Shirt" ... "Red", "Shirt" ... "Blue", "Shirt" ... "Black". Which three operations will you need to accomplish this? (Choose three.)
Use the Count() function over the ColorList field to determine the number of loop iterations.
Use the @ITERATION variable to determine which color in the ColorList field to extract using the Field function.
Specify the following loop condition: @ITERATION <= n,where n is a stage variable initialized with number of loop iterations.
What is the correct restructure stage to extract data from four input columns (three of which carry vectors) and output them into two columns with one vectors in a subrecord and the non-vector carrying column being passed through the stage?
Column Import Stage
You have a dataset hash-partitioned and sorted on CustID. You need to create a Sequential File sorted on CustID from this dataset. Which technique is the most appropriate?
Use SortMerge collector for the Sequential File input link.
Which of the following describes what happens when Runtime Column Propagation is enabled?
An input column value flows into a target column only if it matches it by name.
A dataset was created earlier with one partition, with the Preserve Partitioning flag set. A subsequent job using a 2-node configuration file reads from that dataset, passing data to a Transformer stage with "Same" partitioning. Which two statements are true? (Select two)
DataStage will repartition the data and run one instance of the Transformer stage.
Setting the Preserve Partitioning flag to "Clear" will generate two instances of the Transformer stage.
Which two statements are correct about XML stages and their usage? (Choose two.)
XML Transformer stage converts an XML document to another XML hierarchical format.
XML Output stage converts relational tables and sequential files to XML hierarchical structures.
Which of the following is not an ODBC connector property?
Which two of the following derivations for an output column named OutCol can be used to generate a unique sequence of integers across multiple partitions?
@PARTITIONNUM + @NUMPARTITIONS
Which two statements are true about DataStage Parallel Build stages? (Choose two.)
On the Logic Post-Loop,you specify code that runs after all input records have been processed.
The Build stage can be programmed to read and write automatically or code can be written to manually read and write rows.
A scenario requires selecting only the most recent transactions for each of 2 million unique customers, from a 20 million row DB2 source table containing order history. Which parallel job design would satisfy this functional requirement?
Using the DB2 API stage,select all rows. Use a Sort stage with customer number and order date sort keys in ascending order,then Remove Duplicates with Last Duplicate to retain.
Which two statements are true regarding access to a MQ queue? (Choose two.)
MQ stage and MQ connector stages support destructive reads.
MQ connector stage is capable of connecting to Queue manager in both Server and Client mode.
You have been asked to delete a shared container from the project by your customer. Before you do this you want to make sure it will not impact other objects in the project. How will you ensure that deleting the shared container will not cause a failure when jobs are recompiled?
Select the shared container,Right-click on the Where used (deep) command.
Which requirement must be met to read from a database in parallel using the ODBC connector?
Set the Enable partitioning property to Yes.
A job is using the Load method in the DB2 and Oracle Enterprise stage for parallel loading into a single table. The job fails at runtime and you see there are new table rows added. What would explain this result? (Select two)
Load operations are parallel.
Commit size is smaller than source data.
Records in a source file must be copied to multiple output streams for further processing. Which two conditions would require the use of a Transformer stage instead of a Copy stage? (Choose two.)
Concatenating data from multiple input columns.
Directing selected output records down one output link rather than another.
You are describing to your customer how to work with packages. There are two steps that must be carried out in order to move the package to a target project. Which two steps must be carried out to move the DataStage objects to the Production system? (Choose two.)
Which three DataStage elements allow you to create Shared MetaData? (Select three)
DataStage table definition
You have a dataset that is range-partitioned and sorted. Which technique is most efficient to create globally sorted data file?
Insert Funnel stage before the final Sequential File stage and select Sequence as Funnel Type.
What two stages are required to place Header and Detail records, which have different formats, into a single format? (Choose two.)
Column Export Stage
What three types of links are supported by the Lookup Stage? (Select three)
A reference link
A single input link
A single output link
A client has multiple servers connected with a high bandwidth switch and has installed DataStage on one of them. There are five files (with the same record layout) that must be retrieved from three other servers using FTP. Which approach will retrieve and process all five files in the minimal amount of time?
Use a single FTP Enterprise stage and specify five URI properties (one for each file). Next,process the records without landing to disk.
Records in a source file must be aggregated for further processing. Which two conditions would require the use of a Transformer stage instead of or in addition to an Aggregator stage? (Choose two.)
The aggregation is a custom formula for your enterprise.
Aggregation results outside of a certain range need to be flagged as abnormal.
Which three environment variables are used for reporting on DataStage jobs? (Select three)
You are given a job requirement that requires calculations based on the latest currency exchange rate. You decide to invoke a Web service to obtain the latest exchange rate. What are two ways that you can perform a Web service invocation in a DataStage job to obtain the latest exchange rate? (Select two)
Use Web Services Transformer stage.
Use Web Services Client stage as a lookup.
You are setting up project defaults. Which three items can be set in DataStage Administrator? (Choose three.)
default for compile options
defaults for environment variables
default for Runtime Column Propagation
Which two statements are true about the usage of scratch disk? (Choose two.)
You can define multiple scratch disk spaces to distribute disk I/O.
The parallel framework uses the disk space specified in the scratch disk setting to buffer virtual data set records.
What are the three aggregation types can be set in the Aggregator stage? (Select three)
In your DB2 database you have column names that use characters # and $. Which two steps should be completed to allow DataStage to convert these characters into internal format and back as necessary? (Select two)
Set environment variable $DS_ENABLE_RESERVED_CHAR_CONVERT to true.
An important top level Job Sequence has been modified. It controls a process flow consisting of sub Job Sequences, Routines and Parallel Jobs. You must set up a new project to test the top level Job Sequence. You are not allowed to move any repository objects that are not impacted by the top level Job Sequence. What two options would identify only the repository objects impacted by the change made to the top level Job Sequence? (Select two)
Repository Advanced Find Where Used
On Export check Include dependencies of
Which three statements are true about stage variables in a Transformer Stage? (Choose three.)
Stage variables can be set to NULL.
Varchar stage variables can be initialized with spaces.
The expression executed for a stage variable can refer to a stage variable which is executed later.
What are two ways to specify a stylesheet in an XML input stage? (Choose two.)
On the output link advanced properties tab,type the stylesheet in the Stylesheet box.
On the output link advanced properties tab,load the content or path of a stylesheet that is stored on the engine tier host.
A customer requires that a single output file generated by a parallel job be created in sort order. Which two job designs would achieve this goal? (Choose two.)
Use a parallel sort and partition the input to the SequentialFile stage using a Sort Merge collector.
Set the "Execution Mode" in the Advanced stage properties tab of the parallel Sort stage to execute sequentially.
What is the lowest CPU cost partitioning method?
When you run a parallel job, any error messages and warnings are written to the job log and can be viewed from the Director client. What two levels of message handlers are there? (Choose two.)