Labour Day Special 65% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: exams65

CompTIA Data+ Certification Exam Question and Answers

CompTIA Data+ Certification Exam

Last Update May 1, 2024
Total Questions : 262

We are offering FREE DA0-001 CompTIA exam questions. All you do is to just go and sign up. Give your details, prepare DA0-001 free exam questions and then go for complete pool of CompTIA Data+ Certification Exam test questions that will help you more.

DA0-001 pdf

DA0-001 PDF

$35  $99.99
DA0-001 Engine

DA0-001 Testing Engine

$42  $119.99
DA0-001 PDF + Engine

DA0-001 PDF + Testing Engine

$56  $159.99
Questions 1

Which of the following is the most likely reason for a data analyst to optimize a query using parameterization?

Options:

A.  

To return a subset of records

B.  

To insert a temporary table

C.  

To prevent SQL injections

D.  

To increase the query speed

Discussion 0
Questions 2

Given the information in the following tables:

Which of the following describes merging these tables to create a master file that includes all transactions for both online and in-store sales?

Options:

A.  

Data audit

B.  

Data completeness

C.  

Data validation

D.  

Data consolidation

Discussion 0
Questions 3

An analyst needs to create an analytics dashboard for an employee intranet site to improve the search functionality, display relevant information, and maintain an updated FAQ page. Which of the following visualizations would best represent what employees are searching for?

Options:

A.  

A word cloud

B.  

A histogram

C.  

A pie chart

D.  

A scatter plot

Discussion 0
Questions 4

Amanda needs to create a dashboard that will draw information from many other data sources and present it to business leaders.

Which one of the following tools is least likely to meet her needs?

Options:

A.  

QuickSight.

B.  

Tableau.

C.  

Power BI.

D.  

SPSS Modeler.

Discussion 0
Questions 5

A data analyst has been asked to create a daily manufacturing report for the floor manager Which of the following metrics should be included in the report?

Options:

A.  

Tons of steel produced per hour

B.  

Annual sales budget

C.  

End-of-day stock price

D.  

Daily corporate employee count

Discussion 0
Questions 6

A development company is constructing a new Init in its apartment complex. The complex has the following floor plans:

Using the average cost per square foot of the original floor plans. which of the following should be the price of the Rose Init?

Options:

A.  

$640,900

B.  

$690,000

C.  

$705,200

D.  

$702,500

Discussion 0
Questions 7

Encryption is a mechanism for protecting data.

When should encryption be applied to data?

Choose the best answer.

Options:

A.  

When data is at rest.

B.  

When data is at rest or in transit.

C.  

When data is in transit.

D.  

When data is at rest, unless you are using local storage.

Discussion 0
Questions 8

The current date is July 14, 2020. A data analyst has been asked to create a report that shows the company’s year-over-year Q2 2020 sales. Which of the following reports should the analyst compare?

Options:

A.  

A Q2 2020 and Q4 2019

B.  

YTD 2020 and YTD 2019

C.  

Q2 2020 and Q2 2019

D.  

Q2 2020 and Q2 2021

Discussion 0
Questions 9

A data analyst is asked on the morning of April 9, 2020, to create a sales report that identifies sales year to date. The daily sales data is current through the end of the day. Which of the following date ranges should be on the report?

Options:

A.  

January 1, 2020 to April 1, 2020

B.  

January 1, 2020 to April 7, 2020

C.  

January 1, 2020 to April 8, 2020

D.  

January 1, 2020 to April 9, 2020

Discussion 0
Questions 10

Which of the following is an example of PII?

Options:

A.  

Age

B.  

Name

C.  

Ethnicity

D.  

Gender

Discussion 0
Questions 11

An analyst reviews the following data:

7

3

5

2

3

7

7

10

Which of the following is the value of the mode?

Options:

A.  

3

B.  

5

C.  

7

D.  

10

Discussion 0
Questions 12

A data analyst must fulfill a request for information that is needed weekly and should be automatically emailed to a specific set of users. Which of the following types of reports should the analyst recommend?

Options:

A.  

A self-service report

B.  

A research report

C.  

An ad hoc report

D.  

An operational report

Discussion 0
Questions 13

The process of performing initial investigations on data to spot outliers, discover patterns, and test assumptions with statistical insight and graphical visualization is called:

Options:

A.  

a t-test.

B.  

a performance analysis.

C.  

an exploratory data analysis.

D.  

a link analysis.

Discussion 0
Questions 14

Which of the following is a characteristic of a relational database?

Options:

A.  

It utilizes key-value pairs.

B.  

It has undefined fields.

C.  

It is structured in nature.

D.  

It uses minimal memory.

Discussion 0
Questions 15

Which of the following would be used to store unstructured data from different sources?

Options:

A.  

A data lake

B.  

A database management system

C.  

A database

D.  

A data warehouse

Discussion 0
Questions 16

Which one of the following is a common data warehouse schema?

Options:

A.  

Snowflake.

B.  

Square.

C.  

Spiral.

D.  

Sphere.

Discussion 0
Questions 17

Which of the following would be the best way to identify multicollinear attributes in a data set?

Options:

A.  

Correlation coefficient

B.  

Chi-squared test

C.  

Two-sample f-test

D.  

Two-way ANOVA

Discussion 0
Questions 18

You are working with a dataset and want to change the names of categories that you used for different types of books.

What term best describes this action?

Options:

A.  

Recording.

B.  

Summarizing

C.  

Aggregating.

D.  

Filtering.

Discussion 0
Questions 19

An analyst is currently working on a ticket for revamping a company-wide dashboard that has been in use for five years. Which of the following should be the first step in the development process?

Options:

A.  

Talk to the group that made the request to determine the desired goal.

B.  

Make changes to a frequently used report that is already in production.

C.  

Build an additional dashboard with fewer views that are tailored toward each specific team.

D.  

Develop a more streanMined dashboard to roll out by the next delivery date.

Discussion 0
Questions 20

Which of the following report types is most appropriate for a high-level, year-end report requested by a Chief Executive Officer?

Options:

A.  

Dynamic

B.  

Recurring

C.  

Ad hoc

D.  

Self-service

Discussion 0
Questions 21

A data set was recorded using multimedia technology. Which of the following is a necessary step on the way to interpretation?

Options:

A.  

Structural equation modeling

B.  

Transcription

C.  

Sequential analysis

D.  

Sampling

Discussion 0
Questions 22

An analyst has written the following code:

SELECT *

FROM Cust_table

WHERE age > 60 AND City = "New York"

Which of the following criteria is the analyst retrieving?

Options:

A.  

All customers older than age 60 in New York state

B.  

All customers aged 60 and older in New York state

C.  

All customers older than age 60 in New York City

D.  

All customers younger than age 60 in New York City

Discussion 0
Questions 23

Given the table below:

Which of the following boxes indicates that a Type Il error has occurred?

Options:

A.  

1

B.  

2

C.  

3

D.  

4

Discussion 0
Questions 24

A data analyst needs to present the results of an online marketing campaign to the marketing manager. The manager wants to see the most important KPIs and measure the return on marketing investment. Which of the following should the data analyst use to BEST communicate this information to the manager?

Options:

A.  

A real-time monitor that allows the manager to view performance the day the campaign was launched

B.  

A sell-service dashboard that allows the manager to look at the company's annual budget performance

C.  

A spreadsheet of the raw data from all marketing campaigns and channels

D.  

A summary with statistics, conclusions, and recommendations from the data analyst

Discussion 0
Questions 25

A JSON file is an example of:

Options:

A.  

structured data.

B.  

web data.

C.  

machine data.

D.  

processed data.

Discussion 0
Questions 26

A data analyst needs to create a weekly recurring report on sales performance and distribute it to all sales managers. Which of the following would be the BEST method to automate and ensure successful delivery for this task?

Options:

A.  

Use scheduled report delivery.

B.  

Implement subscription access delivery.

C.  

Print out a copy.

D.  

Upload the report to the server.

Discussion 0
Questions 27

An analyst needs to provide a chart to identify the composition between the categories of the survey response data set:

Which of the following charts would be BEST to use?

Options:

A.  

Histogram

B.  

Pie

C.  

Line

D.  

Scatter pot

E.  

Waterfall

Discussion 0
Questions 28

An analyst is working with the income data of suburban families in the United States. The data set has a lot of outliers, and the analyst needs to provide a measure that represents the typical income. Which of the following would BEST fulfill the analyst’s goal?

Options:

A.  

Median

B.  

Mean

C.  

Mode

D.  

Standard deviation

Discussion 0
Questions 29

A survey asks participants to rate a company on a scale of one to ten. Which of the following best describes the rating variable?

Options:

A.  

Continuous

B.  

Ordinal

C.  

Categorical

D.  

Nominal

Discussion 0
Questions 30

The number of phone calls that the call center receives in a day is an example of:

Options:

A.  

continuous data.

B.  

categorical data.

C.  

ordinal data.

D.  

discrete data.

Discussion 0
Questions 31

After completing web scraping, which of the following file formats needs to be parsed?

Options:

A.  

.html

B.  

.txt

C.  

.csv

D.  

.tsv

Discussion 0
Questions 32

Kelly wants to get feedback on the final draft of a strategic report that has taken her six months to develop.

What can she do to get prevent confusion as see seeks feedback before publishing the report?

Choose the best answer.

Options:

A.  

Distribute the report to the appropriate stakeholders via email.

B.  

Use a watermark to identify the report as a draft.

C.  

Show the report to her immediate supervisor.

D.  

Publish the report on an internally facing website.

Discussion 0
Questions 33

Which one of the following is a measure of dispersion?

Options:

A.  

Variance.

B.  

Mode.

C.  

Median.

D.  

Mean.

Discussion 0
Questions 34

The ACME Corporation hired an analyst to detect data quality issues in their Excel documents. Which of the following are the most common issues? (Select TWO)

Options:

A.  

Apostrophe.

B.  

Commas.

C.  

Symbols.

D.  

Duplicates.

E.  

Misspellings.

Discussion 0
Questions 35

Which of following is a non-relational database?

Options:

A.  

Neo4j

B.  

SQLite

C.  

MySQL

D.  

PostgreSQL

Discussion 0
Questions 36

A military commander would like to see the health scorecards of the troops daily and filter them based on gender and rank. Considering this data is PHI, which of the following would be the best way for the commander to view the information?

Options:

A.  

An emailed report

B.  

A password-protected dashboard

C.  

A daily printout of a report

D.  

A cloud-hosted spreadsheet

Discussion 0
Questions 37

Which one of the following would not normally be considered a summary statistic?

Options:

A.  

z-score.

B.  

Mean.

C.  

Variance.

D.  

Standard deviation.

Discussion 0
Questions 38

Emma is working in a data warehouse and finds a finance fact table links to an organization dimension, which in turn links to a currency dimension that not linked to the fact table.

What type of design pattern is the data warehouse using?

Options:

A.  

Star.

B.  

Sun.

C.  

Snowflake.

D.  

Comet.

Discussion 0
Questions 39

An analyst modified a data set that had a number of issues. Given the original and modified versions:

Which of the following data manipulation techniques did the analyst use?

Options:

A.  

Imputation

B.  

Recoding

C.  

Parsing

D.  

Deriving

Discussion 0
Questions 40

An analyst needs to provide a chart to identify the composition between the categories of the survey response data set:

Which of the following charts would be BEST to use?

Options:

A.  

Histogram

B.  

Pie

C.  

Line

D.  

Scatter pot

E.  

Waterfall

Discussion 0
Questions 41

Joe. an analyst. tests the loading time on a dashboard he is preparing to go live and finds it is slower than he would like. Which of the following must occur to decrease the loading time?

Options:

A.  

Deploy the dashboard to production.

B.  

Change the field definitions.

C.  

Update the dashboard subscribers.

D.  

Optimize the dashboard.

Discussion 0
Questions 42

An e-commerce company recently tested a new website layout. The website was tested by a test group of customers, and an old website was presented to a control group. The table below shows the percentage of users in each group who made purchases on the websites:

Which of the following conclusions is accurate at a 95% confidence interval?

Options:

A.  

In Germany, the increase in conversion from the new layout was not significant.

B.  

In France, the increase in conversion from the new layout was not significant.

C.  

In general, users who visit the new website are more likely to make a purchase.

D.  

The new layout has the lowest conversion rates in the United Kingdom.

Discussion 0
Questions 43

Which of the following is the correct data type for text?

Options:

A.  

Boolean

B.  

String

C.  

Integer

D.  

Float

Discussion 0
Questions 44

A data analyst is asked to create a sales report for the second-quarter 2020 board meeting, which will include a review of the business’s performance through the second quarter. The board meeting will be held on July 15, 2020, after the numbers are finalized. Which of the following report types should the data analyst create?

Options:

A.  

Static

B.  

Real-time

C.  

Self-service

D.  

Dynamic

Discussion 0
Questions 45

Samantha needs to share a list of her organization's top 50 customers with the VP of sales.

She would like to include the name of the customer, the business they represent, their contact information, and their total sales over the past year.

The VP does not have any specialized analytics skills or software but would like to make some personal notes on the dataset.

What would be the best tool for Samantha to use to share this information?

Options:

A.  

Power BI.

B.  

Microsoft Excel.

C.  

Minitab.

D.  

SAS.

Discussion 0
Questions 46

Which of the following reports can be used when insight into operational performance is needed each Wednesday?

Options:

A.  

Static report

B.  

Tactical report

C.  

Recurring report

D.  

Ad hoc report

Discussion 0
Questions 47

An analyst has been tracking company intranet usage and has been asked to create a chat to show the most-used/most-clicked portions of a homepage that contains more than 30 links. Which of the following visualizations would BEST illustrate this information?

Options:

A.  

Scatter plot

B.  

Heat map

C.  

Pie chart

D.  

Infographic

Discussion 0
Questions 48

Taylor wants to investigate how manufacturing, marketing, and sales expenditures impact overall profitability for her company.

Which of the following systems is the most appropriate?

Options:

A.  

OLTP.

B.  

OLAP.

C.  

Data warehouse.

D.  

Data mart.

Discussion 0
Questions 49

A recurring event is being stored in two databases that are housed in different geographical locations. A data analyst notices the event is being logged three hours earlier in one database than in the other database. Which of the following is the MOST likely cause of the issue?

Options:

A.  

The data analyst is not querying the databases correctly.

B.  

The databases are recording different events.

C.  

The databases are recording the event in different time zones.

D.  

The second database is logging incorrectly.

Discussion 0
Questions 50

Which of the following is a best practice when updating a legacy data source?

Options:

A.  

Placing old data in new fields

B.  

Keeping only the most recent data

C.  

Creating a codebook to document field changes

D.  

Removing the data source from production

Discussion 0
Questions 51

Which of the following contains alphanumeric values?

Options:

A.  

10.1Ε²

B.  

13.6

C.  

1347

D.  

A3J7

Discussion 0
Questions 52

A sales director has requested a report for individual team members within the division be developed. The director would like the report to be shared with all team members, but individual team members should not be identifiable within the report Which of the following access requirements would support the director's needs?

Options:

A.  

Create an acceptable use policy for the sales data.

B.  

Release the report as user-group-based access and include data masking.

C.  

Get a data use agreement from the individual team members.

D.  

Provide the report based on role and include data encryption.

Discussion 0
Questions 53

Which of the following should be accomplished NEXT after understanding a business requirement for a data analysis report?

Options:

A.  

Rephrase the business requirement.

B.  

Determine the data necessary for the analysis

C.  

Build a mock dashboard/presentation layout.

D.  

Perform exploratory data analysis.

Discussion 0
Questions 54

A data analyst wants to create "Income Categories" that would be calculated based on the existing variable "Income". The "Income Categories" would be as follows:

Income category 1: less than $1.

Income category 2: more than $1 and less than $20,000.

Income category 3: more than $20,001 and less than $40,000.

Income category 4: more than $40,001.

Which of the following data manipulation techniques should the data analyst use to create "Income Categories"?

Options:

A.  

Data merge

B.  

Derived variables

C.  

Data blending

D.  

Data append

Discussion 0
Questions 55

Given the following report:

Which of the following components need to be added to ensure the report is point-in-time and static? (Choose two.)

Options:

A.  

A control group for the phrases

B.  

A summary of the KPIs

C.  

Filter buttons for the status

D.  

The date when the report was last accessed

E.  

The time period the report covers

F.  

The date on which the report was run

Discussion 0
Questions 56

Which of the following descriptive statistical methods are measures of central tendency? (Choose two.)

Options:

A.  

Mean

B.  

Minimum

C.  

Mode

D.  

Variance

E.  

Correlation

F.  

Maximum

Discussion 0
Questions 57

Which of the following will MOST likely be streamed live?

Options:

A.  

Machine data

B.  

Key-value pairs

C.  

Delimited rows

D.  

Flat files

Discussion 0
Questions 58

A data analyst is developing a dashboard to track and monitor metrics. Which of the following best practices should be taken into during the FIRST pment process?

Options:

A.  

Create a A Aupirarrame:

B.  

Deploy to production.

C.  

Copy a dashboard design from the Internet.

D.  

Develop a dashboard.

Discussion 0
Questions 59

Analytics reports should follow corporate style guidelines.

Options:

A.  

True.

B.  

False.

Discussion 0
Questions 60

A data analyst needs to create a dashboard using the company's yearly revenue data sets. Which of the following would be the best way to plot the information to show the top-performing region?

Options:

A.  

A line chart

B.  

A waterfall chart

C.  

A heat map

D.  

A stacked bar chart

Discussion 0
Questions 61

Which of the following variable name formats would be problematic if used in the majority of data software programs?

Options:

A.  

First_Name_

B.  

FirstName

C.  

First_Name

D.  

First Name

Discussion 0
Questions 62

Which of the following is a common data analytics tool that is also used as an interpreted, high-level, general-purpose programming language?

Options:

A.  

SAS

B.  

Microsoft Power B1

C.  

IBM SPSS

D.  

Python

Discussion 0
Questions 63

An analyst is working on a project for a director. During this process. the analyst pulled the data. created summarized tables and graphs with descriptions, created a report summary, and inserted all items into a report. After writing the report, which of the following would be the most appropriate next step?

Options:

A.  

Complete an audit on the data pulled for the report.

B.  

Complete a check for quality in the report.

C.  

Complete a review of the data and a check for consistency

D.  

Complete a trend analysis to be included in the report.

Discussion 0
Questions 64

Which of the following is a non-parametric test?

Options:

A.  

One-sample t-test

B.  

Two-way ANOVA

C.  

Correlation coefficient

D.  

Spearman's rank correlation

Discussion 0
Questions 65

Five dogs have the following heights in millimeters:

300, 430, 170, 470, 600

Which of the following is the mean height for the five dogs?

Options:

A.  

394mm

B.  

405mm

C.  

493mm

D.  

504mm

Discussion 0
Questions 66

A data analyst is working with a team to create a dashboard for a client who requires on-demand access. Which of the following is the best delivery method to support the clients’ requirement?

Options:

A.  

Email

B.  

Scheduled

C.  

Subscription

D.  

Static

Discussion 0
Questions 67

Daniel is using the structured Query language to work with data stored in relational database.

He would like to add several new rows to a database table.

What command should he use?

Options:

A.  

SELECT.

B.  

ALTER.

C.  

INSERT.

D.  

UPDATE.

Discussion 0
Questions 68

A data scientist wants to see which products make the most money and which products attract the most customer purchasing interest in their company.

Which of the following data manipulation techniques would he use to obtain this information?

Options:

A.  

Data append

B.  

Data blending

C.  

Normalize data

D.  

Data merge

Discussion 0
Questions 69

Given the following customer and order tables:

Which of the following describes the number of rows and columns of data that would be present after performing an INNER JOIN of the tables?

Options:

A.  

Five rows, eight columns

B.  

Seven rows, eight columns

C.  

Eight rows, seven columns

D.  

Nine rows, five columns

Discussion 0
Questions 70

An analyst has conducted a review of business questions. Which of the following should the analyst do next to conduct an analysis?

Options:

A.  

Determine the data needs and review the observations.

B.  

Determine the data needs and sources for analysis.

C.  

Determine the data needs and schedule interviews.

D.  

Determine the data needs and begin the analysis.

Discussion 0
Questions 71

Which of the following is the best description of the term "data governance"?

Options:

A.  

Data governance governs the development of a data visualization dashboard in an organization.

B.  

Data governance is the policy that protects against data breaches by cybercriminals.

C.  

Data governance is the process of analyzing, manipulating, and reporting data in an organization.

D.  

Data governance is the availability, usability, integrity, and security of data in an enterprise.

Discussion 0
Questions 72

An analyst is designing a dashboard that will provide a story of the sales and sales customer ratio. The following data is available:

Which of the following charts should the analyst consider including in the dashboard?

Options:

A.  

A column chart with site and sales

B.  

A line chart with site and sales

C.  

A pie chart with site and sales

D.  

A scatter chart with site and sales

Discussion 0
Questions 73

A data analyst has been asked to merge the tables below, first performing an INNER JOIN and then a LEFT JOIN:

Customer Table -

In-store Transactions –

Which of the following describes the number of rows of data that can be expected after performing both joins in the order stated, considering the customer table as the main table?

Options:

A.  

INNER: 6 rows; LEFT: 9 rows

B.  

INNER: 9 rows; LEFT: 6 rows

C.  

INNER: 9 rows; LEFT: 15 rows

D.  

INNER: 15 rows; LEFT: 9 rows

Discussion 0
Questions 74

Five dogs have the following heights in millimeters:

300,430, 170, 470, 600

Which of the following is the standard deviation for the five dogs?

Options:

A.  

147mm

B.  

154mm

C.  

394 mm

D.  

21,704mm

Discussion 0
Questions 75

A user imports a data file into the accounts payable system each day. On a regular basis. the field input is not what the system is expecting. so it results in an error for the row and a broken import process. To resolve the issue, the user opens the file, finds the error in the row, and manually corrects it before attempting the import again. The import sometimes breaks on subsequent attempts. though. Which of the following changes should be made to this process to reduce the number of errors?

Options:

A.  

Delete all incorrect inputs and upload the corrected file.

B.  

Have the user manually review the file for data completeness before loading it

C.  

Create a data field to data type validator to run the file through prior to import.

D.  

Spot-check the file prior to import to catch and correct field errors.

Discussion 0
Questions 76

Which of the following best describes how discrete data differs from continuous data?

Options:

A.  

Discrete data cannot create a sloped line.

B.  

Discrete data can only be a finite number of values.

C.  

Discrete data can have decimal points.

D.  

Discrete data applies only to numbers.

Discussion 0
Questions 77

Given the data below:

In which of the following file formats is the data presented?

Options:

A.  

Xs

B.  

CSV

C.  

RIF

D.  

XML

Discussion 0
Questions 78

A research analyst wants to determine whether the data being analyzed is connected to other datapoints. Which of the following is the BEST type of analysis to conduct?

Options:

A.  

Trend analysis

B.  

Performance analysis

C.  

Link analysis

D.  

Exploratory analysis

Discussion 0