AWS Interview Questions and Answer Part – 24

Database clients can connect to Redshift using ODBC and JDBC drivers for Postgres.

  • True (Ans)
  • False

Cloudera Impala is included on EMR clusters by default.

  • True
  • False (Ans)

EC2 stands for:

  • External Cloud Connectivity
  • Elastic Compute Cloud (Ans)
  • Energy Conserving CPU
  • Enhanced Capacity, Squared

How does DynamoDB display data rows of differing schema together?

  • It displays common keys as distinct columns in a table, and the rest in an “Other” column
  • It combines all keys from all rows as if they were common to each row (Ans)
  • It omits any key absent from any row
  • It displays data in multiple frids, each with differing column headers

Which of the following services is a component of the AWS Big Data Stack?

  • Redshift (Ans)
  • VPC
  • Cloud Watch
  • Glacier

Amazon offers a code library that allows developers to build their own Kinesis data connectors.

  • False
  • True (Ans)

Designing Jaspersoft reports is done in a standalone desktop application.

  • True
  • False (Ans)

Which of the following is not directly supported as a Pipeline data source or destination?

  • Microsoft SQL Server on Relational Database Service (RDS) (Ans)
  • DynamoDB
  • Redshift
  • MySQL on Relational Database Service (RDS)

Which of the following Hadoop vendor’s distribution is supported on Elastic MapReduce?

  • Cloudera
  • Hortonworks
  • MapR (Ans)
  • Pivotal

Which of the following Hadoop distribution components does Elastic MapReduce omit?

  • MapReduce
  • Sqoop (Ans)
  • Hive
  • Pig

What is the name of Apache Pig’s programming language?

  • Pig Latin (Ans)
  • PQL
  • PigML
  • Pig.js

Amazon CloudFormation can be used to provision Japsersoft instances.

  • False
  • True (Ans)

You can create a key pair in the AWS Management Console.

  • True (Ans)
  • False

Saving a Pipeline causes a validation check to be run on it.

  • False
  • True (Ans)

Files stored in S3 can be referenced by URL.

  • True (Ans)
  • False

Kinesis has data connectors for all Amazon components except:

  • MySQL Relational Database Service (RDS) (Ans)
  • DynamoDB
  • Redshift
  • EMR

The purpose of a DynamoDB secondary indexes is to:

  • Allow for fast searches on attributes beyond hash and range keys (Ans)
  • Allow for querying data using SQL
  • Allow indexes to be added after table creation
  • Make data available on more nodes for distributed access

Redshift uses relational database technology.

  • True (Ans)
  • False

Which of the following is an effective method for monitoring running Data Pipeline jobs?

  • Run the pipeline in Eclipse
  • Set breakpoints in Pipeline script file
  • Set email alerts for activity nodes
  • Ensure no steps in status page stay in WAITING_FOR_RUNNER or WAITING_ON_DEPENDENCIES state too long (Ans)

Which of the following is the largest enabler of the Big Data phenomenon?

  • Commercial software
  • Relational databases
  • Significant reduction in storage costs (Ans)
  • Specialized hardware appliances

Which of the following does S3 Browser need to connect to your S3 account?

  • Public and private key pair
  • Access key ID and secret access key (Ans)
  • AWS username and password
  • Bucket name and folder name

Impala provides interactive (non-batch) SQL query over data in:

  • The Hadoop Distributed File System (HDFS) (Ans)
  • Relational Database Service (RDS)
  • DynamoDB
  • Redshift
