Machine learning with Apache SystemML Quiz Answers

Get Machine learning with Apache SystemML Quiz Answers

Apache SystemML is a declarative style language designed for large-scale machine learning. It provides automatic generation of optimized runtime plans ranging from single-node, to in-memory, to distributed computations on Apache Hadoop and Apache Spark. SystemML algorithms are expressed in R-like or Python-like syntax that includes linear algebra primitives, statistical functions and ML-specific constructs. 

As a data scientist, engineer, or just a fellow interested in machine learning, your productivity will increase while having the flexibility to express custom analytics and not worry about the underlying optimization engine. Automatic scalability and optimization is handled by SystemML. This course will not only provide you with a view of how the optimizers function but also provide hands-on examples of ML algorithms and how to run them.

Enroll on Cognitive Class

Module 1 – What is SystemML?

Question: In machine learning, as analytical models are exposed to new data, they are able to independently adapt. True or false?

  • True
  • False

Question: Which of the following are types of alternatives to SystemML?

  • R
  • MLlib
  • Spark R
  • Mahout
  • All of the above

Question: The R language was designed for machine learning and works great for big data. True or false?

  • True
  • False

Module 2 – SystemML and the Spark MLContext

Question: What the ways you can use SystemML’s Spark MLContext?

  • spark-shell
  • Through an application using the API
  • Through the SystemML console
  • A notebook interface
  • None of the above

Question: You must pass in the reference of the SparkContext to the MLContext constructor. True or false?

  • True
  • False

Question: Why would you use the Spark MLContext?

  • Programmatic interface into SystemML’s libraries
  • To benefit from the optimizations that come with SystemML
  • When you need to convert the data to a binary block matrix
  • A and B only
  • None of the above

Module 3 – SystemML algorithms

Question: The Classification algorithm of ensemble learning method that creates a model composed of a set of tree models for classification. True or false?

  • True
  • False

Question: K-means is an unsupervised learning algorithm used to assign a category label to each record so that each similar record tend to get the same label. True or false.

  • True
  • False

Question: The Kaplan-Meier algorithm predicts how likely it is someone will purchase a product of similar category. True or false?

  • True
  • False

Module 4 – Declarative Machine Learning (DML)

Question: What does DML stand for?

  • Data manipulation language
  • Data machine language
  • Declarative machine learning
  • Declarative machine language

Question: To run a DML script, which of the following jar file is required at runtime?

  • MLContext.jar
  • DML.jar
  • SystemML.jar
  • spark-context.jar

Question: Which of the following way to pass command-line arguments is recommended?

  • positional arguments
  • named arguments
  • a comma separated list
  • a file

Module 5 – SystemML architecture and optimization

Question: In the ALS performance comparison, at which dataset does the MLlib code run out of memory??

  • Large
  • Medium
  • Small
  • None

Question: Which of the following does NOT belong to the SystemML Optimizer stack?

  • Create the RDDs for the high level algorithm
  • Compute memory estimates
  • Generate runtime program
  • Live variable analysis

Question: How does SystemML know it is better to run the code on one machine?

  • Advanced Rewrites
  • Propagation of statistics
  • Live variable analysis
  • Efficient runtime
  • The developer tells it to

Final Exam

Question: What is machine learning?

  • Artificial intelligence for machines to make decisions
  • Same as data science to gather insight using machines
  • Enable computers to learn without being explicitly programmed
  • Learning about how machines operate

Question: What is the purpose of SystemML?

  • Programming language for big data
  • In-memory analytics engine
  • Machine learning for spark
  • Machine learning on hadoop
  • All of the above

Question: What are the challenges of machine learning on big data using R?

  • Programmers are needed to convert the high level code to low level code for parallel computing
  • Each iteration of the code takes time to be rewritten and recompile
  • Chances for errors are higher during the translation of the algorithms
  • All of the above

Question: What is the vision of SystemML?

  • Run the same algorithm developed for small data on big data
  • Provide flexible algorithm of ML algorithms
  • Automatic generation of hybrid runtime plans
  • All of the above

Question: Which of the following languages is SystemML most similar?

  • R
  • Python
  • Java
  • Scala
  • Perl
  • R and Python
  • Java and Scala

Question: Which of the following line of code will launch the Spark shell with SystemML?

  • ./bin/spark-shell –jars SystemML.jar
  • ./bin/spark-shell –executor-memory 4G –jars SystemML.jar
  • ./bin/spark-shell –driver-memory 4G –jars SystemML.jar
  • ./bin/spark-shell –executor-memory 4G –driver-memory 4G –jars SystemML.jar
  • All of the above

Question: Why would you convert a DataFrame to a binary-block matrix?

  • To enable parallelization within the Spark engine
  • To use the rich set of APIs provided by the binary-block matrix
  • Allows algorithm performance to be measured separately from data conversion time
  • Allows a more efficient runtime processing of the data

Question: Which of the following is TRUE with regards to helper methods in SystemML?

  • SystemML’s output is encapsulated in the MLContext object
  • SystemML’s output is encapsulated in the MLOutput object
  • Helper methods retrieves the values from the MLOutput object
  • Helper methods retrieves the values from the MLContext object
  • A and D only
  • B and C only

Question: Which is NOT a benefit of using SystemML algorithms?

  • Run in parallel
  • It is faster than all other algorithms
  • No need for translation into a lower level language
  • Algorithms are optimized based on data and cluster characteristics

Question: Which of the following classes of algorithms provide a recommendation?

  • Regression
  • Classification
  • Matrix Factorization
  • Descriptive statistics

Question: Which of the following algorithm can group a set of data into known categories?

  • Regression
  • Clustering
  • Survival Analysis
  • Classification

Question: Which of the following algorithm can be used for prediction, forecasting, or error reduction?

  • Clustering
  • Regression
  • Survival Analysis
  • Descriptive statistics

Question: Which of the following value typesis NOT supported in the DML language?

  • String
  • Double
  • Varchar
  • Boolean

Question: Matrix-vector operations avoids the need for creating replicated matrix for a certain subset of operations. True or false?

  • True
  • False

Question: Global variables cannot be access within a function. True or false?

  • True
  • False

Question: Which of the following are NOT types of categories of built-in functions in DML?

  • Derivative built-in functions
  • Matrix built-in functions
  • Statistical built-in functions
  • Casting built-in functions

Question: In the statistics propagation phase of the SystemML optimizer, what exactly is happening?

  • To determine the confidence level of the computed results
  • All the statistics is propagated to the top node to determine the most efficient runtime for query execution
  • To determine of probability of the operation succeeding within a given period of time
  • Find the widest matrix required and determine if it all fits into the heap.

Question: What is the benefit of doing the matrix rewrite?

  • Reduce the line of code needed to represent the matrix
  • To determine the confidence level of the computed results
  • Clean up and unused memory from the matrix
  • To enable parallelization of the given matrixithin a given period of time
  • Represent the final matrix without computing the intermediate matrices

Question: Which is NOT part of the SystemML runtime for Spark?

  • Automates critical performance decisions
  • Distributed vs. local runtime
  • Efficient linear algebra optimizations
  • Automated RDD caching
  • None of the above

Question: SystemML is an Apache open source project. True or false

  • True
  • False

Conclusion:

We hope you know the correct answers to Machine learning with Apache SystemML If Queslers helped you to find out the correct answers then make sure to bookmark our site for more Course Quiz Answers.

If the options are not the same then make sure to let us know by leaving it in the comments below.

Course Review:

In our experience, we suggest you enroll in this and gain some new skills from Professionals completely free and we assure you will be worth it.

This course is available on Cognitive Class for free, if you are stuck anywhere between quiz or graded assessment quiz, just visit Queslers to get all Quiz Answers and Coding Solutions.

More Courses Quiz Answers >>

Building Cloud Native and Multicloud Applications Quiz Answers

Accelerating Deep Learning with GPUs Quiz Answers

Machine Learning With R Cognitive Class Answers

Machine Learning with Python Cognitive Class Answers

Leave a Reply

Your email address will not be published.