Beware data bias in AI models

List of website locations and languages available in Americas
Location	Languages Available
Argentina	Spanish
Bermuda	English
Brazil	Portuguese
Canada	English French
Chile	Spanish
Colombia	Spanish
Costa Rica	Spanish
El Salvador	Spanish
Guatemala	Spanish
Honduras	Spanish
Mexico	Spanish
Nicaragua	Spanish
Panama	Spanish
Peru	Spanish
United States	English
Venezuela	Spanish

List of website locations and languages available in Asia-Pacific
Location	Languages Available
Australia	English
China	Simplified Chinese
Hong Kong (China, SAR)	English
India	English
Indonesia	English
Japan	Japanese
Korea	Korean
Malaysia	English
New Zealand	English
Philippines	English
Singapore	English
Taiwan	Traditional Chinese
Thailand	English Thai
Vietnam	English Vietnamese

List of website locations and languages available in Europe
Location	Languages Available
Austria	German
Belgium	English French Flemish
Croatia	English Croatian
Czech Republic	English Czech
Denmark	Danish
Finland	Finnish
France	French
Germany	German
Greece	Greek
Hungary	Hungarian
Ireland	English
Italy	Italian
Kazakhstan	Kazakh Russian
Luxembourg	French
Netherlands	Dutch English
Norway	Norwegian
Poland	Polish
Portugal	Portuguese
Romania	Romanian
Serbia	Serbian
Slovakia	Slovak
Spain	Spanish
Sweden	English Swedish
Switzerland	English French German
Turkey	Turkish
Ukraine	Ukrainian
United Kingdom	English

List of website locations and languages available in Middle East and Africa
Location	Languages Available
Cameroon	English French
Congo	French
Egypt	English
Ghana	English
Ivory Coast	French
Israel	English
Jordan	English
Kenya	English
Kuwait	English
Mauritius	English
Nigeria	English
Saudi Arabia	English
Senegal	French
South Africa	English
UAE	English
Uganda	English

Insurers should be aware of the risks of data bias associated with artificial intelligence (AI) models. Atreyee Bhattacharyya looks at some of these risks, particularly the ethical considerations and how an actuary can address these.

The use of advanced analytics techniques and machine learning models in insurance has increased significantly over the last few years. It’s an exciting time for actuaries and an opportunity to innovate. We have seen leading insurers in this area driving better insights and increasing predictive powers, ultimately leading to better performance.

However, with every new technology comes new risks. With AI, such risks could be material in terms of regulatory implications, litigation, public perception, and reputation.

Why data bias in AI models matters

The ethical risks associated with data bias are not particular to just AI models, but data bias is more prevalent in AI models because:

AI models make predictions based on patterns in data without assuming any particular form of statistical distribution. Since these models learn from historical data, any biases present in the training data can be perpetuated by the AI systems. This can lead to biased outcomes and unfair treatment for certain groups or individuals.

For instance, a tech giant had to abandon the trial of a recruitment AI system when it was found to discriminate against women for technical roles. This turned out to be the result of training the model with a dataset spanning a number of years and since, historically, the majority of these roles were held by males, the algorithm undervalued applications from women.

Furthermore, AI models can inadvertently reinforce existing biases present in society or in existing practices. For example, if historical data reflects biased decisions made by humans, the AI model may learn and perpetuate those biases. This creates a feedback loop where biased AI outcomes further reinforce the existing biases. Non-AI models may be less susceptible to this feedback loop as they typically don't have the ability to learn and adapt over time.
AI models can process vast amounts of data at a fast rate, enabling them to make decisions and predictions on a large scale and in real-time. This amplifies the potential impact of biases present in the data if human oversight is missing or reduced.
AI models can be highly complex and opaque, making it challenging to understand how they arrive at decisions. This lack of transparency can make it difficult to detect and address biases within the models. In contrast, non-AI models, such as traditional rule-based systems or models based on statistical distributions, are often more transparent, allowing humans to directly inspect and understand the decision-making process.

Given these factors, data bias is a more critical concern in AI and addressing and mitigating data bias is crucial to ensure fair and ethical outcomes in AI models.

What are the various kinds of data biases?

Selection bias arises when certain samples are systematically overrepresented or underrepresented in the training data. This can occur if data collection processes inadvertently favour certain groups or exclude others. As a result, the AI model may be more accurate or effective for the overrepresented groups. Also, if the training data does not adequately capture the diversity of the target population, the AI model may not generalise well and could make inaccurate or unfair predictions. This might happen if, for example, an Asian health insurer bases its pricing on an AI model which has been trained predominantly on health metrics data from Western populations; the result will most likely not be accurate and fair.

Temporal bias refers to biases that emerge due to changes in societal norms, regulations, or circumstances over time. If the training data does not adequately represent the present reality or includes outdated information, the AI model may produce biased predictions or decisions that are not aligned with current regulatory and social dynamics. If historical data contains discriminatory practices or reflects societal biases, the AI model may learn and perpetuate those biases, resulting in unfair treatment or discrimination against specific groups of individuals. For instance, a lawsuit was filed against a US-based insurer which used an AI fraud detection model to help with claims management. The model outputs meant that black customers were subject to a significantly higher level of scrutiny compared to their white counterparts, resulting in more interactions and paperwork, thus longer delays in settling claims. It has been argued that the AI model perpetuated the racial bias already existent in the historical data.

Proxy bias arises when the training data includes variables that act as proxies for sensitive attributes, such as race or gender. Even if these sensitive attributes are not explicitly included in the data, the AI model may indirectly infer them from the proxy variables, leading to biased outcomes. For instance, occupation could act as a proxy for gender and location could act as a proxy for ethnicity. Fitting these in the model could result in biased predictions even if the protected characteristics are not captured in the data.

Moreover, these types of bias can often overlap and interact with each other, making it necessary to adopt comprehensive strategies to identify, mitigate, and monitor biases in AI models.

Ways to mitigate data bias

To mitigate the risks associated with data bias, an actuary will benefit from gaining a thorough understanding of the data collection methods used and identifying any potential sources of bias in the data collection process. Actuaries often have control over data quality improvement processes where they are involved in data cleaning, removing outliers, and addressing missing values. By applying rigorous data cleaning techniques, biases which are introduced by data quality issues can be reduced. For example, if a particular demographic group has disproportionately missing data, imputing missing values in a manner that preserves fairness and avoids bias can help mitigate bias in the analysis.

If the training data contains imbalanced representations of different demographic groups, resampling techniques can be employed to address the imbalance and give equal, or representative, weight to all groups, reducing potential bias.

Internal data can be supplemented with external data sources that provide a broader perspective and mitigate potential biases. By incorporating external data, the representation of various demographic groups can be expanded. However, insurers also need to be cautious about the potential biases in external data sources. The applicability and relevance of the external data to the analysis needs to be carefully considered.

Actuaries often also need to make assumptions when building models or performing analyses. As well as considering data biases, it is crucial to critically assess these assumptions for potential biases. For example, if an assumption implicitly assumes uniformity across different demographic groups, it could introduce bias. A practitioner should validate these assumptions using available data, conduct sensitivity analyses, and challenge the assumptions to ensure they do not lead to biased results.

Model validations to reduce ethical risk in AI

As well as mitigating data biases, actuaries should also design a robust model governance framework. This should include regular monitoring and evaluation of the model outputs against actual emerging data. Actuaries should carefully analyse the tail ends of the model output distribution to gain an understanding on the nature of the risk profile of individuals getting a significantly high or low prediction. If the predictions at the tails are materially different from the acceptable range, they could take a decision to apply caps and collars to the model prediction.

Continuously monitoring and evaluating the model performance, particularly in terms of fairness metrics, across different demographic groups should help identify any emerging biases. These could then be rectified by taking corrective actions and updating the model.

Fairness tests for a model government framework

Fairness tests for a model government framework
Fairness measure	Test
Demographic parity	The average premium is calculated for each level of the protected characteristic, for the entire portfolio sample. The criteria is met if the average premiums differ by less then the Fairness Margin specified by the user.
Conditional demographic parity	The average premium is calculated for each level of the protected characteristic, for each possible combination (Group) of rating factors fed into the Fairness Module. The criteria for each group are met if the average premiums across the levels of the protected characteristic differ by less than the Fairness Margin specified by the user.
Broad risk parity	The average premium and risk cost are calculated for each level of the protected characteristic, for each risk cost band (Group) fed into the Fairness Module. The criteria are met if, for all risk groups, the average premium across the levels of the protected characteristic are within the Fairness Margin specified by the user.
Broad return parity	The average premium and risk cost are calculated for each level of the protected characteristic, for each premium band (Group) fed into the Fairness Module. The criteria are met if, for all premium groups, the average risk cost across the levels of the protected characteristic are within the Fairness Margin specified by the user.

It can be challenging to collect the data needed for a fully robust analysis of fairness when it is not typically collected by an insurer. There may therefore be a need for the use of proxies (as described earlier) or allocation methods that use data that may be unavailable to the model, to assess the fairness.

Practitioners should also focus on conducting ethical reviews of the model's design, implementation, and impact to ensure compliance with legal and regulatory requirements on fairness and non-discrimination. Ethical review processes can help identify and address potential biases before deploying the models in practice.

It is also vital to gain a deep understanding of the algorithm and features of the model. Incorporating explainability into a model is essential in building the trust of the management, regulator and the customer. Models that enable explainability can more easily reveal bias and identify areas for improvement. Gaining a deeper understanding of the drivers of the output should also facilitate interventions that could potentially give rise to more favourable outcome for the business. Explainability metrics such as SHAP values, ICE plots and partial dependency plots should be part of the model governance framework. Apart from performing reasonability checks on values of these metrics across variables, it might also be worth comparing these against similar and comparable metrics (e.g., partial dependency plots vs generalised linear model (GLM) relativities) based on traditional algorithms. This should highlight any significant deviation that might need control or correction.

Another way of addressing model bias is to incorporate fairness considerations directly into the model training process by using techniques that explicitly account for fairness. For example, fairness-aware learning algorithms, like equalised odds, can be used to enhance fairness during the training process.

Potential bias awareness is key

The application of advanced analytics techniques, when used appropriately, can create opportunities for insurers to offer customers greater access to more targeted products at equitable prices, promoting safer behaviours and enhancing overall business outcomes.

However, it is crucial to recognise the substantial consequences associated with neglecting the risks associated with AI models that could affect business viability, regulatory compliance, and reputation. Establishing trust is key to the advancement of model techniques. Thoughtful consideration and mitigation of ethical risks should not only ensure a fairer outcome for society, but also advance the use of AI models within the insurance industry.

Author

Atreyee Bhattacharyya

Associate Director