Development Phase

The development phase of FUTURE-AI framework focuses on implementing the design specifications and requirements through five key recommendations that emphasize data collection, privacy, and technical implementation. Table 4 provides a comprehensive breakdown of these development phase recommendations, detailing specific operations and examples for each recommendation, including representative data collection (robustness 2), collection of individual attributes (fairness 2), data privacy and security measures (general 2), risk mitigation implementation (general 3), and human-AI interaction mechanisms (usability 2), with varying compliance requirements between research (+) and deployable (++) AI tools.

The framework emphasizes stronger compliance requirements for deployable AI tools compared to research applications during the development phase, particularly in areas like human-AI interactions and oversight, reflecting the higher standards needed for real-world implementation. Only robustness 2 (training with representative real-world data) and general measures for privacy, security and risk mitigation are highly recommended (++) for both research and deployable tools.

Practical steps and examples to implement FUTURE-AI recommendations during development phase

Recommendations	Operations	Examples
Collect representative training dataset (robustness 2)	Collect training data that reflect the demographic variations	According to age, sex, ethnicity, socioeconomics
	Collect training data that reflect the clinical variations	Disease subgroups, treatment protocols, clinical outcomes, rare cases
	Collect training data that reflect variations in real world practice	Data acquisition protocols, data annotations, medical equipment, operational variations (eg, patient motion during scanning)
	Artificially enhance the training data to mimic real world conditions	Data augmentation, data synthesis (eg, low quality data, noise addition), data harmonisation, data homogenisation
Collect information on individuals’ and data attributes (fairness 2)	Request approval for collecting data on personal attributes	Sex, age, ethnicity, socioeconomic status
	Collect information on standard attributes of the individuals (if available and allowed)	Sex, age, nationality, education
	Include application specific information relevant for fairness analysis	Skin colour, breast density, presence of implants, comorbidity
	Estimate data distributions across subgroups	Male v female, across ethnic groups
	Collect information on data provenance	Data centres, equipment characteristics, data preprocessing, annotation processes
Implement measures for data privacy and security (general 2)	Implement measures to ensure data privacy and security	Data deidentification, federated learning, differential privacy, encryption[
	Implement measures against malicious attacks	Firewalls, intrusion detection systems, regular security audits
	Adhere to applicable data protection regulations	General Data Protection Regulation, Health Insurance Portability and Accountability Act
	Define suitable data governance mechanisms	Access control, logging system
Implement measures against identified AI risks (general 3)	Implement a baseline AI model and identify its limitations	Bias, lack of generalisability
	Implement methods to enhance robustness to real world variations (if needed)	Regularisation, data augmentation, data harmonisation, domain adaptation
	Implement methods to enhance generalisability across settings (if needed)	Regularisation, transfer learning, knowledge distillation
	Implement methods to enhance fairness across subgroups (if needed)	Data resampling, bias free representation, equalised odds postprocessing
Establish mechanisms for human-AI interactions (usability 2)	Implement mechanisms to standardise data preprocessing and labelling	Data preprocessing pipeline, data labelling plugin
	Implement an interface for using the AI model	Application programming interface
	Implement interfaces for explainable AI	Visual explanations, heatmaps, feature importance bars
	Implement mechanisms for user centred quality control of the AI results	Visual quality control, uncertainty estimation
	Implement mechanism for user feedback	Feedback interface