Weekend Sale - Limited Time 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: xmaspas7

Easiest Solution 2 Pass Your Certification Exams

CT-AI ISTQBCertified Tester AI Testing Exam Free Practice Exam Questions (2025 Updated)

Prepare effectively for your ISTQB CT-AI ISTQBCertified Tester AI Testing Exam certification with our extensive collection of free, high-quality practice questions. Each question is designed to mirror the actual exam format and objectives, complete with comprehensive answers and detailed explanations. Our materials are regularly updated for 2025, ensuring you have the most current resources to build confidence and succeed on your first attempt.

Page: 1 / 2
Total 80 questions

A company producing consumable goods wants to identify groups of people with similar tastes for the purpose of targeting different products for each group. You have to choose and apply an appropriate ML type for this problem.

Which ONE of the following options represents the BEST possible solution for this above-mentioned task?

SELECT ONE OPTION

A.

Regression

B.

Association

C.

Clustering

D.

Classification

Before deployment of an AI-based system, a developer is expected to demonstrate in a test environment how decisions are made. Which of the following characteristics does decision making fall under?

A.

Explainability

B.

Autonomy

C.

Self-learning

D.

Non-determinism

A bank wants to use an algorithm to determine which applicants should be given a loan. The bank hires a data scientist to construct a logistic regression model to predict whether the applicant will repay the loan or not. The bank has enough data on past customers to randomly split the data into a training dataset and a test/validation dataset. A logistic regression model is constructed on the training dataset using the following independent variables:

    Gender

    Marital status

    Number of dependents

    Education

    Income

    Loan amount

    Loan term

    Credit score

The model reveals that those with higher credit scores and larger total incomes are more likely to repay their loans. The data scientist has suggested that there might be bias present in the model based on previous models created for other banks.

Given this information, what is the best test approach to check for potential bias in the model?

A.

Experience-based testing should be used to confirm that the training data set is operationally relevant. This can include applying exploratory data analysis (EDA) to check for bias within the training data set.

B.

Back-to-back testing should be used to compare the model created using the training data set to another model created using the test data set. If the two models significantly differ, it will indicate there is bias in the original model.

C.

Acceptance testing should be used to make sure the algorithm is suitable for the customer. The team can re-work the acceptance criteria such that the algorithm is sure to correctly predict the remaining applicants that have been set aside for the validation dataset ensuring no bias is present.

D.

A/B testing should be used to verify that the test data set does not detect any bias that might have been introduced by the original training data. If the two models significantly differ, it will indicate there is bias in the original model.

"AllerEgo" is a product that uses sell-learning to predict the behavior of a pilot under combat situation for a variety of terrains and enemy aircraft formations. Post training the model was exposed to the real-

world data and the model was found to be behaving poorly. A lot of data quality tests had been performed on the data to bring it into a shape fit for training and testing.

Which ONE of the following options is least likely to describes the possible reason for the fall in the performance, especially when considering the self-learning nature of the Al system?

SELECT ONE OPTION

    The difficulty of defining criteria for improvement before the model can be accepted.

    The fast pace of change did not allow sufficient time for testing.

    The unknown nature and insufficient specification of the operating environment might have caused the poor performance.

A.

There was an algorithmic bias in the Al system.

Which ONE of the following options BEST DESCRIBES clustering?

SELECT ONE OPTION

A.

Clustering is classification of a continuous quantity.

B.

Clustering is supervised learning.

C.

Clustering is done without prior knowledge of output classes.

D.

Clustering requires you to know the classes.

Which ONE of the following statements is a CORRECT adversarial example in the context of machine learning systems that are working on image classifiers.

SELECT ONE OPTION

A.

Black box attacks based on adversarial examples create an exact duplicate model of the original.

B.

These attack examples cause a model to predict the correct class with slightly less accuracy even though they look like the original image.

C.

These attacks can't be prevented by retraining the model with these examples augmented to the training data.

D.

These examples are model specific and are not likely to cause another model trained on same task to fail.

Which of the following is a dataset issue that can be resolved using pre-processing?

A.

Insufficient data

B.

Invalid data

C.

Wanted outliers

D.

Numbers stored as strings

Upon testing a model used to detect rotten tomatoes, the following data was observed by the test engineer, based on certain number of tomato images.

For this confusion matrix which combinations of values of accuracy, recall, and specificity respectively is CORRECT?

SELECT ONE OPTION

A.

0.87.0.9. 0.84

B.

1,0.87,0.84

C.

1,0.9, 0.8

D.

0.84.1,0.9

Which ONE of the following tests is MOST likely to describe a useful test to help detect different kinds of biases in ML pipeline?

SELECT ONE OPTION

A.

Testing the distribution shift in the training data for inappropriate bias.

B.

Test the model during model evaluation for data bias.

C.

Testing the data pipeline for any sources for algorithmic bias.

D.

Check the input test data for potential sample bias.

Which ONE of the following options represents a technology MOST TYPICALLY used to implement Al?

SELECT ONE OPTION

A.

Search engines

B.

Procedural programming

C.

Case control structures

D.

Genetic algorithms

An airline has created an ML model to project fuel requirements for future flights. The model imports weather data such as wind speeds and temperatures, calculates flight routes based on historical routings from air traffic control, and estimates loads from average passenger and baggage weights. The model performed within an acceptable standard for the airline throughout the summer but as winter set in, the load weights became less accurate. After some exploratory data analysis, it became apparent that luggage weights were higher in the winter than in summer.

Which of the following statements BEST describes the problem and how it could have been prevented?

A.

The model suffers from drift and therefore should be regularly tested to ensure that any occurrences of drift are detected soon enough for the problem to be mitigated

B.

The model suffers from drift and therefore the performance standard should be eased until a new model with more transparency can be developed

C.

The model suffers from corruption and therefore should be reloaded into the computer system being used, preferably with a method of version control to prevent further changes

D.

The model suffers from a lack of transparency and therefore should be regularly tested to ensure that any progressive errors are detected soon enough for the problem to be mitigated

Which of the following is one of the reasons for data mislabelling?

A.

Lack of domain knowledge

B.

Expert knowledge

C.

Interoperability error

D.

Small datasets

ln the near future, technology will have evolved, and Al will be able to learn multiple tasks by itself without needing to be retrained, allowing it to operate even in new environments. The cognitive abilities of Al are similar to a child of 1-2 years.’

In the above quote, which ONE of the following options is the correct name of this type of Al?

SELECT ONE OPTION

A.

Technological singularity

B.

Narrow Al

C.

Super Al

D.

General Al

Which of the following problems would best be solved using the supervised learning category of regression?

A.

Determining the optimal age for a chicken's egg-laying production using input data of the chicken's age and average daily egg production for one million chickens

B.

Recognizing a knife in carry-on luggage at a security checkpoint in an airport scanner

C.

Determining if an animal is a pig or a cow based on image recognition

D.

Predicting shopper purchasing behavior based on the category of shopper and the positioning of promotional displays within a store

Arihant Meditation is a startup using Al to aid people in deeper and better meditation based on analysis of various factors such as time and duration of the meditation, pulse and blood pressure, EEG patters etc. among others. Their model accuracy and other functional performance parameters have not yet reached their desired level.

Which ONE of the following factors is NOT a factor affecting the ML functional performance?

SELECT ONE OPTION

A.

The data pipeline

B.

The quality of the labeling

C.

Biased data

D.

The number of classes

The stakeholders of a machine learning model have confirmed that they understand the objective and purpose of the model, and ensured that the proposed model aligns with their business priorities. They have also selected a framework and a machine learning model that they will be using. What should be the next step to progress along the machine learning workflow?

A.

Prepare and pre-process the data that will be used to train and test the model

B.

Tune the machine learning algorithm based on objectives and business priorities

C.

Agree on defined acceptance criteria for the machine learning model

D.

Evaluate the selection of the framework and the model

A company is using a spam filter to attempt to identify which emails should be marked as spam. Detection rules are created by the filter that causes a message to be classified as spam. An attacker wishes to have all messages internal to the company be classified as spam. So, the attacker sends messages with obvious red flags in the body of the email and modifies the "from" portion of the email to make it appear that the emails have been sent by company members. The testers plan to use exploratory data analysis (EDA) to detect the attack and use this information to prevent future adversarial attacks.

How could EDA be used to detect this attack?

A.

EDA can help detect the outlier emails from the real emails

B.

EDA can detect and remove the false emails

C.

EDA can restrict how many inputs can be provided by unique users

D.

EDA cannot be used to detect the attack

Consider a machine learning model where the model is attempting to predict if a patient is at risk for stroke. The model collects information on each patient regarding their blood pressure, red blood cell count, smoking status, history of heart disease, cholesterol level, and demographics. Then, using a decision tree the model predicts whether or not the associated patient is likely to have a stroke in the near future. Once the model is created using a training dataset, it is used to predict a stroke in 80 additional patients. The table below shows a confusion matrix on whether or not the model made a correct or incorrect prediction.

The testers have calculated what they believe to be an appropriate functional performance metric for the model. They calculated a value of 0.6667.

Which metric did the testers calculate?

A.

F1-score

B.

Precision

C.

Recall

D.

Accuracy

Which ONE of the following models BEST describes a way to model defect prediction by looking at the history of bugs in modules by using code quality metrics of modules of historical versions as input?

SELECT ONE OPTION

A.

Identifying the relationship between developers and the modules developed by them.

B.

Search of similar code based on natural language processing.

C.

Clustering of similar code modules to predict based on similarity.

D.

Using a classification model to predict the presence of a defect by using code quality metrics as the input data.

Which of the following are the three activities in the data acquisition activities for data preparation?

A.

Cleaning, transforming, augmenting

B.

Feature selecting, feature growing, feature augmenting

C.

Identifying, gathering, labelling

D.

Building, approving, deploying

Page: 1 / 2
Total 80 questions
Copyright © 2014-2025 Solution2Pass. All Rights Reserved