New Year Sale Special - Limited Time 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: xmaspas7

Easiest Solution 2 Pass Your Certification Exams

CT-AI ISTQB Certified Tester AI Testing Exam Free Practice Exam Questions (2026 Updated)

Prepare effectively for your ISTQB CT-AI ISTQB Certified Tester AI Testing Exam certification with our extensive collection of free, high-quality practice questions. Each question is designed to mirror the actual exam format and objectives, complete with comprehensive answers and detailed explanations. Our materials are regularly updated for 2026, ensuring you have the most current resources to build confidence and succeed on your first attempt.

Page: 2 / 2
Total 120 questions

Which ONE of the following describes a situation of back-to-back testing the LEAST?

SELECT ONE OPTION

A.

Comparison of the results of a current neural network model ML model implemented in platform A (for example Pytorch) with a similar neural network model ML model implemented in platform B (for example Tensorflow), for the same data.

B.

Comparison of the results of a home-grown neural network model ML model with results in a neural network model implemented in a standard implementation (for example Pytorch) for same data

C.

Comparison of the results of a neural network ML model with a current decision tree ML model for the same data.

D.

Comparison of the results of the current neural network ML model on the current data set with a slightly modified data set.

Which statement about automation bias is correct?

Choose ONE option (1 out of 4)

A.

When testing AI-based systems, automation bias does not play a role in supporting test activities such as boundary value analysis

B.

Automation bias affects the testing of AI-based systems that support users in their actions or decisions

C.

Automation bias particularly affects testing of autonomous systems

D.

Automation bias is tested with representative users, but human input quality is irrelevant

You are testing an autonomous vehicle which uses AI to determine proper driving actions and responses. You have evaluated the parameters and combinations to be tested and have determined that there are too many to test in the time allowed. It has been suggested that you use pairwise testing to limit the parameters. Given the complexity of the software under test, what is likely the outcome from using pairwise testing?

A.

The number of parameters to test can be reduced to less than a dozen

B.

All high priority defects will be identified using this method

C.

While the number of tests needed can be reduced, there may still be a large enough set of tests that automation will be required to execute all of them

D.

Pairwise cannot be applied to this problem because there is AI involved and the evolving values may result in unexpected results that cannot be verified

A software component uses machine learning to recognize the digits from a scan of handwritten numbers. In the scenario above, which type of Machine Learning (ML) is this an example of?

SELECT ONE OPTION

A.

Reinforcement learning

B.

Regression

C.

Classification

D.

Clustering

Which ONE of the following tests is LEAST likely to be performed during the ML model testing phase?

SELECT ONE OPTION

A.

Testing the accuracy of the classification model.

B.

Testing the API of the service powered by the ML model.

C.

Testing the speed of the training of the model.

D.

Testing the speed of the prediction by the model.

The training of an ML model… What type of bias is LEAST important to look for when testing the model?

Choose ONE option (1 out of 4)

A.

Inappropriate bias

B.

Automation bias

C.

Algorithmic bias

D.

Sample bias

Which of the following statements about reinforcement learning is correct?

Choose ONE option (1 out of 4)

A.

The agent creates a model of the environment from labeled data during training

B.

The approach is suitable when the application doesnotrequire interaction with the environment

C.

The agent’s training is based on a reward function that rewards successful attempts

D.

From experience, the agent learns theoptimal reward function

Which of the following is a dataset issue that can be resolved using pre-processing?

A.

Insufficient data

B.

Invalid data

C.

Wanted outliers

D.

Numbers stored as strings

Which supervised-learning classification/regression statement is correct?

Choose ONE option (1 out of 4)

A.

Recognizing a dog from many different images is a regression problem

B.

Deciding whether an object is a bicycle or a motorcycle is a classification problem

C.

Predicting that diesel prices will increase by ~10% is a classification problem

D.

In classification, objects are always assigned to exactly two classes

Which of the following statements about the structure and function of neural networks is true?

Choose ONE option (1 out of 4)

A.

The bias of a neuron is determined by the activation values of the neurons in the previous layer

B.

Training a neural network only changes the values of the weights at the connections between neurons

C.

A single-layer perceptron is NOT a neural network

D.

The input layer of a deep neural network must have at least as many neurons as its output layer

Consider an AI-system in which the complex internal structure has been generated by another software system. Why would the tester choose to do black-box testing on this particular system?

A.

Test automation can be built quickly and easily from the test cases developed during black-box testing

B.

The tester wishes to better understand the logic of the software used to create the internal structure

C.

The black-box testing method will allow the tester to check the transparency of the algorithm used to create the internal structure

D.

Black-box testing eliminates the need for the tester to understand the internal structure of the AI-system

There is a growing backlog of unresolved defects for your project. You know the developers have an ML model that they have created which has learned which developers work on which type of software and the speed with which they resolve issues. How could you use this model to help reduce the backlog and implement more efficient defect resolution?

A.

Use it to prioritize defects automatically based on the time expected for the fix to be made, the speed of the fix, and the likelihood of regressions

B.

Use it to assign defects to the best developer to resolve the problem and to load balance the defect assignments among the developers

C.

Use it to determine the root cause of each defect and develop a process improvement plan that can be implemented to remove the most common root causes

D.

Use it to review the code and determine where more defects are likely to occur so that testing can be targeted to those areas

Which statement about testing to prevent data poisoning and adversarial attacks is correct?

Choose ONE option (1 out of 4)

A.

Regression testing can be used to verify data sourcing policies to ensure the source of training data.

B.

The adversarial examples identified during adversarial testing must not be added to the training data so that they do not poison the model.

C.

Adversarial testing consists of using adversarial attacks to identify vulnerabilities so that they can be eliminated.

D.

Using AIB testing to identify data poisoning can better identify outliers than exploratory data analysis.

Which of the following is THE LEAST appropriate tests to be performed for testing a feature related to autonomy?

SELECT ONE OPTION

A.

Test for human handover to give rest to the system.

B.

Test for human handover when it should actually not be relinquishing control.

C.

Test for human handover requiring mandatory relinquishing control.

D.

Test for human handover after a given time interval.

"AllerEgo" is a product that uses sell-learning to predict the behavior of a pilot under combat situation for a variety of terrains and enemy aircraft formations. Post training the model was exposed to the real-

world data and the model was found to be behaving poorly. A lot of data quality tests had been performed on the data to bring it into a shape fit for training and testing.

Which ONE of the following options is least likely to describes the possible reason for the fall in the performance, especially when considering the self-learning nature of the Al system?

SELECT ONE OPTION

    The difficulty of defining criteria for improvement before the model can be accepted.

    The fast pace of change did not allow sufficient time for testing.

    The unknown nature and insufficient specification of the operating environment might have caused the poor performance.

A.

There was an algorithmic bias in the Al system.

A bank wants to use an algorithm to determine which applicants should be given a loan. The bank hires a data scientist to construct a logistic regression model to predict whether the applicant will repay the loan or not. The bank has enough data on past customers to randomly split the data into a training dataset and a test/validation dataset. A logistic regression model is constructed on the training dataset using the following independent variables:

    Gender

    Marital status

    Number of dependents

    Education

    Income

    Loan amount

    Loan term

    Credit score

The model reveals that those with higher credit scores and larger total incomes are more likely to repay their loans. The data scientist has suggested that there might be bias present in the model based on previous models created for other banks.

Given this information, what is the best test approach to check for potential bias in the model?

A.

Experience-based testing should be used to confirm that the training data set is operationally relevant. This can include applying exploratory data analysis (EDA) to check for bias within the training data set.

B.

Back-to-back testing should be used to compare the model created using the training data set to another model created using the test data set. If the two models significantly differ, it will indicate there is bias in the original model.

C.

Acceptance testing should be used to make sure the algorithm is suitable for the customer. The team can re-work the acceptance criteria such that the algorithm is sure to correctly predict the remaining applicants that have been set aside for the validation dataset ensuring no bias is present.

D.

A/B testing should be used to verify that the test data set does not detect any bias that might have been introduced by the original training data. If the two models significantly differ, it will indicate there is bias in the original model.

Page: 2 / 2
Total 120 questions
Copyright © 2014-2026 Solution2Pass. All Rights Reserved