CT-AI ISTQB Certified Tester AI Testing Exam Free Practice Exam Questions (2026 Updated)

Prepare effectively for your ISTQB CT-AI ISTQB Certified Tester AI Testing Exam certification with our extensive collection of free, high-quality practice questions. Each question is designed to mirror the actual exam format and objectives, complete with comprehensive answers and detailed explanations. Our materials are regularly updated for 2026, ensuring you have the most current resources to build confidence and succeed on your first attempt.

ISTQB CT-AI Premium Access Download Demo

Page: 2 / 2
Total 120 questions

Question # 26

Which ONE of the following describes a situation of back-to-back testing the LEAST?

SELECT ONE OPTION

Comparison of the results of a current neural network model ML model implemented in platform A (for example Pytorch) with a similar neural network model ML model implemented in platform B (for example Tensorflow), for the same data.

Comparison of the results of a home-grown neural network model ML model with results in a neural network model implemented in a standard implementation (for example Pytorch) for same data

Comparison of the results of a neural network ML model with a current decision tree ML model for the same data.

Comparison of the results of the current neural network ML model on the current data set with a slightly modified data set.

Explanation:

Back-to-back testing is a method where the same set of tests are run on multiple implementations of the system to compare their outputs. This type of testing is typically used to ensure consistency and correctness by comparing the outputs of different implementations under identical conditions. Let's analyze the options given:

A. Comparison of the results of a current neural network model ML model implemented in platform A (for example Pytorch) with a similar neural network model ML model implemented in platform B (for example Tensorflow), for the same data.

This option describes a scenario where two different implementations of the same type of model are being compared using the same dataset. This is a typical back-to-back testing situation.

B. Comparison of the results of a home-grown neural network model ML model with results in a neural network model implemented in a standard implementation (for example Pytorch) for the same data.

This option involves comparing a custom implementation with a standard implementation, which is also a typical back-to-back testing scenario to validate the custom model against a known benchmark.

C. Comparison of the results of a neural network ML model with a current decision tree ML model for the same data.

This option involves comparing two different types of models (a neural network and a decision tree). This is not a typical scenario for back-to-back testing because the models are inherently different and would not be expected to produce identical results even on the same data.

D. Comparison of the results of the current neural network ML model on the current data set with a slightly modified data set.

This option involves comparing the outputs of the same model on slightly different datasets. This could be seen as a form of robustness testing or sensitivity analysis, but not typical back-to-back testing as it doesn’t involve comparing multiple implementations.

Based on this analysis, optionCis the one that describes a situation of back-to-back testing the least because it compares two fundamentally different models, which is not the intent of back-to-back testing.

Question # 27

Which statement about automation bias is correct?

Choose ONE option (1 out of 4)

When testing AI-based systems, automation bias does not play a role in supporting test activities such as boundary value analysis

Automation bias affects the testing of AI-based systems that support users in their actions or decisions

Automation bias particularly affects testing of autonomous systems

Automation bias is tested with representative users, but human input quality is irrelevant

Question # 28

You are testing an autonomous vehicle which uses AI to determine proper driving actions and responses. You have evaluated the parameters and combinations to be tested and have determined that there are too many to test in the time allowed. It has been suggested that you use pairwise testing to limit the parameters. Given the complexity of the software under test, what is likely the outcome from using pairwise testing?

The number of parameters to test can be reduced to less than a dozen

All high priority defects will be identified using this method

While the number of tests needed can be reduced, there may still be a large enough set of tests that automation will be required to execute all of them

Pairwise cannot be applied to this problem because there is AI involved and the evolving values may result in unexpected results that cannot be verified

Question # 29

A software component uses machine learning to recognize the digits from a scan of handwritten numbers. In the scenario above, which type of Machine Learning (ML) is this an example of?

SELECT ONE OPTION

Reinforcement learning

Regression

Classification

Clustering

Question # 30

Which ONE of the following tests is LEAST likely to be performed during the ML model testing phase?

SELECT ONE OPTION

Testing the accuracy of the classification model.

Testing the API of the service powered by the ML model.

Testing the speed of the training of the model.

Testing the speed of the prediction by the model.

Question # 31

The training of an ML model… What type of bias is LEAST important to look for when testing the model?

Choose ONE option (1 out of 4)

Inappropriate bias

Automation bias

Algorithmic bias

Sample bias

Question # 32

Which of the following statements about reinforcement learning is correct?

Choose ONE option (1 out of 4)

The agent creates a model of the environment from labeled data during training

The approach is suitable when the application doesnotrequire interaction with the environment

The agent’s training is based on a reward function that rewards successful attempts

From experience, the agent learns theoptimal reward function

Question # 33

Which of the following is a dataset issue that can be resolved using pre-processing?

Insufficient data

Invalid data

Wanted outliers

Numbers stored as strings

Question # 34

Which supervised-learning classification/regression statement is correct?

Choose ONE option (1 out of 4)

Recognizing a dog from many different images is a regression problem

Deciding whether an object is a bicycle or a motorcycle is a classification problem

Predicting that diesel prices will increase by ~10% is a classification problem

In classification, objects are always assigned to exactly two classes

Question # 35

Which of the following statements about the structure and function of neural networks is true?

Choose ONE option (1 out of 4)

The bias of a neuron is determined by the activation values of the neurons in the previous layer

Training a neural network only changes the values of the weights at the connections between neurons

A single-layer perceptron is NOT a neural network

The input layer of a deep neural network must have at least as many neurons as its output layer

Question # 36

Consider an AI-system in which the complex internal structure has been generated by another software system. Why would the tester choose to do black-box testing on this particular system?

Test automation can be built quickly and easily from the test cases developed during black-box testing

The tester wishes to better understand the logic of the software used to create the internal structure

The black-box testing method will allow the tester to check the transparency of the algorithm used to create the internal structure

Black-box testing eliminates the need for the tester to understand the internal structure of the AI-system

Question # 37

There is a growing backlog of unresolved defects for your project. You know the developers have an ML model that they have created which has learned which developers work on which type of software and the speed with which they resolve issues. How could you use this model to help reduce the backlog and implement more efficient defect resolution?

Use it to prioritize defects automatically based on the time expected for the fix to be made, the speed of the fix, and the likelihood of regressions

Use it to assign defects to the best developer to resolve the problem and to load balance the defect assignments among the developers

Use it to determine the root cause of each defect and develop a process improvement plan that can be implemented to remove the most common root causes

Use it to review the code and determine where more defects are likely to occur so that testing can be targeted to those areas

Question # 38

Which statement about testing to prevent data poisoning and adversarial attacks is correct?

Choose ONE option (1 out of 4)

Regression testing can be used to verify data sourcing policies to ensure the source of training data.

The adversarial examples identified during adversarial testing must not be added to the training data so that they do not poison the model.

Adversarial testing consists of using adversarial attacks to identify vulnerabilities so that they can be eliminated.

Using AIB testing to identify data poisoning can better identify outliers than exploratory data analysis.

Question # 39

Which of the following is THE LEAST appropriate tests to be performed for testing a feature related to autonomy?

SELECT ONE OPTION

Test for human handover to give rest to the system.

Test for human handover when it should actually not be relinquishing control.

Test for human handover requiring mandatory relinquishing control.

Test for human handover after a given time interval.

Question # 40

"AllerEgo" is a product that uses sell-learning to predict the behavior of a pilot under combat situation for a variety of terrains and enemy aircraft formations. Post training the model was exposed to the real-

world data and the model was found to be behaving poorly. A lot of data quality tests had been performed on the data to bring it into a shape fit for training and testing.

Which ONE of the following options is least likely to describes the possible reason for the fall in the performance, especially when considering the self-learning nature of the Al system?

SELECT ONE OPTION

The difficulty of defining criteria for improvement before the model can be accepted.

The fast pace of change did not allow sufficient time for testing.

The unknown nature and insufficient specification of the operating environment might have caused the poor performance.

There was an algorithmic bias in the Al system.

Question # 41

A bank wants to use an algorithm to determine which applicants should be given a loan. The bank hires a data scientist to construct a logistic regression model to predict whether the applicant will repay the loan or not. The bank has enough data on past customers to randomly split the data into a training dataset and a test/validation dataset. A logistic regression model is constructed on the training dataset using the following independent variables:

Gender

Marital status

Number of dependents

Education

Income

Loan amount

Loan term

Credit score

The model reveals that those with higher credit scores and larger total incomes are more likely to repay their loans. The data scientist has suggested that there might be bias present in the model based on previous models created for other banks.

Given this information, what is the best test approach to check for potential bias in the model?

Experience-based testing should be used to confirm that the training data set is operationally relevant. This can include applying exploratory data analysis (EDA) to check for bias within the training data set.

Back-to-back testing should be used to compare the model created using the training data set to another model created using the test data set. If the two models significantly differ, it will indicate there is bias in the original model.

Acceptance testing should be used to make sure the algorithm is suitable for the customer. The team can re-work the acceptance criteria such that the algorithm is sure to correctly predict the remaining applicants that have been set aside for the validation dataset ensuring no bias is present.

A/B testing should be used to verify that the test data set does not detect any bias that might have been introduced by the original training data. If the two models significantly differ, it will indicate there is bias in the original model.

ISTQB CT-AI Premium Access Download Demo

Page: 2 / 2
Total 120 questions

Spring Sale Special - Limited Time 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: xmaspas7

CT-AI ISTQB Certified Tester AI Testing Exam Free Practice Exam Questions (2026 Updated)

The Answer Is:

Explanation:

The Answer Is:

Explanation:

The Answer Is:

Explanation:

The Answer Is:

Explanation:

The Answer Is:

Explanation:

The Answer Is:

Explanation:

The Answer Is:

Explanation:

The Answer Is:

Explanation:

The Answer Is:

Explanation:

The Answer Is:

Explanation:

The Answer Is:

Explanation:

The Answer Is:

Explanation:

The Answer Is:

Explanation:

The Answer Is:

Explanation:

The Answer Is:

Explanation:

The Answer Is:

Explanation: