Easter Sale Special 65% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: exams65

CertNexus Certified Artificial Intelligence Practitioner Question and Answers

CertNexus Certified Artificial Intelligence Practitioner

Last Update May 15, 2024
Total Questions : 90

We are offering FREE AIP-210 CertNexus exam questions. All you do is to just go and sign up. Give your details, prepare AIP-210 free exam questions and then go for complete pool of CertNexus Certified Artificial Intelligence Practitioner test questions that will help you more.

AIP-210 pdf

AIP-210 PDF

$35  $99.99
AIP-210 Engine

AIP-210 Testing Engine

$42  $119.99
AIP-210 PDF + Engine

AIP-210 PDF + Testing Engine

$56  $159.99
Questions 1

What is Word2vec?

Options:

A.  

A bag of words.

B.  

A matrix of how frequently words appear in a group of documents.

C.  

A word embedding method that builds a one-hot encoded matrix from samples and the terms that appear in them.

D.  

A word embedding method that finds characteristics of words in a very large number of documents.

Discussion 0
Questions 2

What is the primary benefit of the Federated Learning approach to machine learning?

Options:

A.  

It does not require a labeled dataset to solve supervised learning problems.

B.  

It protects the privacy of the user's data while providing well-trained models.

C.  

It requires less computation to train the same model using a traditional approach.

D.  

It uses large, centralized data stores to train complex machine learning models.

Discussion 0
Questions 3

Which three security measures could be applied in different ML workflow stages to defend them against malicious activities? (Select three.)

Options:

A.  

Disable logging for model access.

B.  

Launch ML Instances In a virtual private cloud (VPC).

C.  

Monitor model degradation.

D.  

Use data encryption.

E.  

Use max privilege to control access to ML artifacts.

F.  

Use Secrets Manager to protect credentials.

Discussion 0
Questions 4

Which of the following is a privacy-focused law that an AI practitioner should adhere to while designing and adapting an AI system that utilizes personal data?

Options:

A.  

General Data Protection Regulation (GDPR)

B.  

ISO/IEC 27001

C.  

PCIDSS

D.  

Sarbanes Oxley (SOX)

Discussion 0
Questions 5

Below are three tables: Employees, Departments, and Directors.

Employee_Table

Department_Table

Director_Table

ID

Firstname

Lastname

Age

Salary

DeptJD

4566

Joey

Morin

62

$ 122,000

1

1230

Sam

Clarck

43

$ 95,670

2

9077

Lola

Russell

54

$ 165,700

3

1346

Lily

Cotton

46

$ 156,000

4

2088

Beckett

Good

52

$ 165,000

5

Which SQL query provides the Directors' Firstname, Lastname, the name of their departments, and the average employee's salary?

Options:

A.  

SELECT m.Firstname, m.Lastname, d.Name, AVG(e.Saiary) as Dept_avg_Saiary

FROM Employee_Table as e

LEFT JOIN Department_Table as d on e.Dept = d.Name

LEFT JOIN Directorjable as m on d.ID = m.DeptJD

GROUP BY m.Firstname, m.Lastname, d.Name

B.  

SELECT m.Firstname, m.Lastname, d.Name, AVG(e.Salary) as Dept_avg_Salary

FROM Employee_Table as e

RIGHT JOIN Departmentjable as d on e.Dept = d.Name

INNER JOIN Directorjable as m on d.ID = m.DeptJD

GROUP BY d.Name

C.  

SELECT m.Firstname, m.Lastname, d.Name, AVG(e.Salary) as Dept_avg_Salary

FROM Employee_Table as e

RIGHT JOIN Department_Table as d on e.Dept = d.Name

INNER JOIN Directorjable as m on d.ID = m.DeptJD

GROUP BY e.Salary

D.  

SELECT m.Firstname, m.Lastname, d.Name, AVG(e.Salary) as Dept_avg_Salary

FROM Employee_Table as e

RIGHT JOIN Department_Table as d on e.Dept = d.Name

INNER JOIN Directorjable as m on d.ID = m.DeptID

GROUP BY m.Firstname, m.Lastname, d.Name

Discussion 0
Questions 6

Workflow design patterns for the machine learning pipelines:

Options:

A.  

Aim to explain how the machine learning model works.

B.  

Represent a pipeline with directed acyclic graph (DAG).

C.  

Seek to simplify the management of machine learning features.

D.  

Separate inputs from features.

Discussion 0
Questions 7

Which of the following describes a benefit of machine learning for solving business problems?

Options:

A.  

Increasing the quantity of original data

B.  

Increasing the speed of analysis

C.  

Improving the constraint of the problem

D.  

Improving the quality of original data

Discussion 0
Questions 8

The graph is an elbow plot showing the inertia or within-cluster sum of squares on the y-axis and number of clusters (also called K) on the x-axis, denoting the change in inertia as the clusters change using k-means algorithm.

What would be an optimal value of K to ensure a good number of clusters?

Options:

A.  

2

B.  

3

C.  

5

D.  

9

Discussion 0
Questions 9

Word Embedding describes a task in natural language processing (NLP) where:

Options:

A.  

Words are converted into numerical vectors.

B.  

Words are featurized by taking a histogram of letter counts.

C.  

Words are featurized by taking a matrix of bigram counts.

D.  

Words are grouped together into clusters and then represented by word cluster membership.

Discussion 0
Questions 10

When working with textual data and trying to classify text into different languages, which approach to representing features makes the most sense?

Options:

A.  

Bag of words model with TF-IDF

B.  

Bag of bigrams (2 letter pairs)

C.  

Word2Vec algorithm

D.  

Clustering similar words and representing words by group membership

Discussion 0
Questions 11

Which two encodes can be used to transform categories data into numerical features? (Select two.)

Options:

A.  

Count Encoder

B.  

Log Encoder

C.  

Mean Encoder

D.  

Median Encoder

E.  

One-Hot Encoder

Discussion 0
Questions 12

A classifier has been implemented to predict whether or not someone has a specific type of disease. Considering that only 1% of the population in the dataset has this disease, which measures will work the BEST to evaluate this model?

Options:

A.  

Mean squared error

B.  

Precision and accuracy

C.  

Precision and recall

D.  

Recall and explained variance

Discussion 0
Questions 13

We are using the k-nearest neighbors algorithm to classify the new data points. The features are on different scales.

Which method can help us to solve this problem?

Options:

A.  

Log transformation

B.  

Normalization

C.  

Square-root transformation

D.  

Standardization

Discussion 0
Questions 14

Which of the following regressions will help when there is the existence of near-linear relationships among the independent variables (collinearity)?

Options:

A.  

Clustering

B.  

Linear regression

C.  

Polynomial regression

D.  

Ridge regression

Discussion 0
Questions 15

Which of the following items should be included in a handover to the end user to enable them to use and run a trained model on their own system? (Select three.)

Options:

A.  

Information on the folder structure in your local machine

B.  

Intermediate data files

C.  

Link to a GitHub repository of the codebase

D.  

README document

E.  

Sample input and output data files

Discussion 0
Questions 16

Your dependent variable Y is a count, ranging from 0 to infinity. Because Y is approximately log-normally distributed, you decide to log-transform the data prior to performing a linear regression.

What should you do before log-transforming Y?

Options:

A.  

Add 1 to all of the Y values.

B.  

Divide all the Y values by the standard deviation of Y.

C.  

Explore the data for outliers.

D.  

Subtract the mean of Y from all the Y values.

Discussion 0
Questions 17

When should the model be retrained in the ML pipeline?

Options:

A.  

A new monitoring component is added.

B.  

Concept drift is detected in the pipeline.

C.  

More data become available for the training phase.

D.  

Some outliers are detected in live data.

Discussion 0
Questions 18

Which of the following metrics is being captured when performing principal component analysis?

Options:

A.  

Kurtosis

B.  

Missingness

C.  

Skewness

D.  

Variance

Discussion 0
Questions 19

Which of the following methods can be used to rebalance a dataset using the rebalance design pattern?

Options:

A.  

Bagging

B.  

Boosting

C.  

Stacking

D.  

Weighted class

Discussion 0
Questions 20

In general, models that perform their tasks:

Options:

A.  

Less accurately are less robust against adversarial attacks.

B.  

Less accurately are neither more nor less robust against adversarial attacks.

C.  

More accurately are less robust against adversarial attacks.

D.  

More accurately are neither more nor less robust against adversarial attacks.

Discussion 0
Questions 21

Which of the following text vectorization methods is appropriate and correctly defined for an English-to-Spanish translation machine?

Options:

A.  

Using TF-IDF because in translation machines, we do not care about the order of the words.

B.  

Using TF-IDF because in translation machines, we need to consider the order of the words.

C.  

Using Word2vec because in translation machines, we do not care about the order of the words.

D.  

Using Word2vec because in translation machines, we need to consider the order of the words.

Discussion 0
Questions 22

Which of the following approaches is best if a limited portion of your training data is labeled?

Options:

A.  

Dimensionality reduction

B.  

Probabilistic clustering

C.  

Reinforcement learning

D.  

Semi-supervised learning

Discussion 0
Questions 23

Which of the following models are text vectorization methods? (Select two.)

Options:

A.  

Lemmatization

B.  

PCA

C.  

Skip-gram

D.  

TF-IDF

E.  

Tokenization

F.  

t-SNE

Discussion 0
Questions 24

You train a neural network model with two layers, each layer having four nodes, and realize that the model is underfit. Which of the actions below will NOT work to fix this underfitting?

Options:

A.  

Add features to training data

B.  

Get more training data

C.  

Increase the complexity of the model

D.  

Train the model for more epochs

Discussion 0
Questions 25

Which two of the following statements about the beta value in an A/B test are accurate? (Select two.)

Options:

A.  

The Beta value is the rate of type II errors for the test.

B.  

The Beta value is the rate of type I errors for the test.

C.  

The statistical power of a test is the inverse of the Beta value, or 1 - Beta.

D.  

The Beta in an Alpha/Beta test represents one of the two variants of the A/B test.

Discussion 0
Questions 26

Which two of the following decrease technical debt in ML systems? (Select two.)

Options:

A.  

Boundary erosion

B.  

Design anti-patterns

C.  

Documentation readability

D.  

Model complexity

E.  

Refactoring

Discussion 0