Reliable Machine Learning

May 16, 2023

Current Machine Learning model evaluation methods, e.g., the use of test sets, will only detect whether a model’s predictions match the data. They cannot exclude the possibility that both predictions and data are biased. More targeted efforts to reduce or eliminate training data biases require either manual adjustments of the models, domain knowledge, or assume that data or model issues can easily be identified. This is seldom the case. Model auditing generally relies on manual analysis of the models, on existing data, and knowledgeable auditors. Automatically identifying deficiencies in both data and training, and how they impact application of models, is still an unsolved question. Bias mitigation approaches require either the ground-truth distribution or concrete information on the bias — or unbiased / differently biased data from other sources. We aim to develop a model-agnostic framework that will not be limited by any of these requirements.

2026

Dost, Katharina; Muraoka, Kohji; Ausseil, Anne-Gaelle; Benavidez, Rubianca; Blue, Brendon; Conland, Nic; Daughney, Chris; Semadeni-Davies, Annette; Hoang, Linh; Hooper, Anna; Kpodonu, Theodore Alfred; Marapara, Tapuwa; McDowell, Richard; Nguyen, Trung; Nguyet, Dang Anh; Norton, Ned; Özkundakci, Deniz; Pearson, Lisa; Rolinson, James; Smith, Ra; Stephens, Tom; Tamepo, Reina; Taylor, Ken; Uitregt, Vincent; Jackson, Bethanna; Sarris, Theo; Elliott, Alexander; Wicker, Jörg

Freshwater modeling in Aotearoa New Zealand: Current practice and future directions Journal Article

In: Environmental Modelling & Software, vol. 197, pp. 106820, 2026, ISSN: 1364-8152.

Abstract | Links | BibTeX

@article{DOST2026106820,

title = {Freshwater modeling in Aotearoa New Zealand: Current practice and future directions},

author = {Katharina Dost and Kohji Muraoka and Anne-Gaelle Ausseil and Rubianca Benavidez and Brendon Blue and Nic Conland and Chris Daughney and Annette Semadeni-Davies and Linh Hoang and Anna Hooper and Theodore Alfred Kpodonu and Tapuwa Marapara and Richard McDowell and Trung Nguyen and Dang Anh Nguyet and Ned Norton and Deniz \"{O}zkundakci and Lisa Pearson and James Rolinson and Ra Smith and Tom Stephens and Reina Tamepo and Ken Taylor and Vincent Uitregt and Bethanna Jackson and Theo Sarris and Alexander Elliott and J\"{o}rg Wicker},

url = {https://www.sciencedirect.com/science/article/pii/S1364815225005043},

doi = {10.1016/j.envsoft.2025.106820},

issn = {1364-8152},

year  = {2026},

date = {2026-01-01},

urldate = {2026-01-01},

journal = {Environmental Modelling \& Software},

volume = {197},

pages = {106820},

abstract = {Freshwater modeling is vital for addressing environmental and societal challenges. In two workshops preceding this article, we revealed issues in current modeling practices in New Zealand, with a focus on catchment-level water quality modelling. Predominant were low trust in models, lack of transparency, and models unfit for purpose. This article uses a root-cause analysis to explore these issues, identify causes, and propose solutions. We find that current best practices and research are a good foundation but insufficient to fulfill our freshwater research and management needs. We advocate for long-term national strategies with centralized funding, standardized documentation, data, models, evaluation techniques, and communication methods, along with a centralized open-access platform for collaboration. Our vision is to streamline modeling projects, enhance the accessibility and reliability of models, and foster more effective decision-making processes for the sustainable management of freshwater ecosystems.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

2025

Dai, Kejun; Kim, Jonathan; Džeroski, Sašo; Wicker, Jörg; Dobbie, Gillian; Dost, Katharina

Assessing the risk of discriminatory bias in classification datasets Journal Article

In: Machine Learning, vol. 114, iss. 9, pp. 204, 2025, ISSN: 1573-0565.

Abstract | Links | BibTeX

Dost, Katharina; Muraoka, Kohji; Ausseil, Anne-Gaelle; Benavidez, Rubianca; Blue, Brendan; Coland, Nic; Daughney, Chris; Semadeni-Davies, Annette; Hoang, Linh; Hooper, Anna; Kpodonu, Theodore Alfred; Marapara, Tapuwa; McDowell, Richard W.; Nguyen, Trung; Nguyet, Dang Anh; Norton, Ned; Özkundakci, Deniz; Pearson, Lisa; Rolinson, James; Smith, Ra; Stephens, Tom; Tamepo, Reina; Taylor, Ken; van Uitregt, Vincent; Jackson, Bethanna; Sarris, Theo; Elliott, Alexander; Wicker, Jörg

Freshwater Quality Modeling in Aotearoa New Zealand: Current Practice and Future Directions Unpublished Forthcoming

SSRN, Forthcoming.

2024

Hafner, Jasmin; Lorsbach, Tim; Schmidt, Sebastian; Brydon, Liam; Dost, Katharina; Zhang, Kunyang; Fenner, Kathrin; Wicker, Jörg

Advancements in Biotransformation Pathway Prediction: Enhancements, Datasets, and Novel Functionalities in enviPath Journal Article

In: Journal of Cheminformatics, vol. 16, no. 1, pp. 93, 2024, ISSN: 1758-2946.

Abstract | Links | BibTeX

Lyu, Jiachen; Dost, Katharina; Koh, Yun Sing; Wicker, Jörg

Regional Bias in Monolingual English Language Models Journal Article

In: Machine Learning, 2024, ISSN: 1573-0565.

Abstract | Links | BibTeX

@article{lyu2023regional,

title = {Regional Bias in Monolingual English Language Models},

author = {Jiachen Lyu and Katharina Dost and Yun Sing Koh and J\"{o}rg Wicker},

url = {https://link.springer.com/article/10.1007/s10994-024-06555-6

https://dx.doi.org/10.21203/rs.3.rs-3713494/v1},

doi = {10.1007/s10994-024-06555-6},

issn = {1573-0565},

year  = {2024},

date = {2024-07-09},

urldate = {2024-07-09},

journal = {Machine Learning},

abstract = { In Natural Language Processing (NLP), pre-trained language models (LLMs) are widely employed and refined for various tasks. These models have shown considerable social and geographic biases creating skewed or even unfair representations of certain groups. Research focuses on biases toward L2 (English as a second language) regions but neglects bias within L1 (first language) regions. In this work, we ask if there is regional bias within L1 regions already inherent in pre-trained LLMs and, if so, what the consequences are in terms of downstream model performance. We contribute an investigation framework specifically tailored for low-resource regions, offering a method to identify bias without imposing strict requirements for labeled datasets. Our research reveals subtle geographic variations in the word embeddings of BERT, even in cultures traditionally perceived as similar. These nuanced features, once captured, have the potential to significantly impact downstream tasks. Generally, models exhibit comparable performance on datasets that share similarities, and conversely, performance may diverge when datasets differ in their nuanced features embedded within the language. It is crucial to note that estimating model performance solely based on standard benchmark datasets may not necessarily apply to the datasets with distinct features from the benchmark datasets. Our proposed framework plays a pivotal role in identifying and addressing biases detected in word embeddings, particularly evident in low-resource regions such as New Zealand.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Chang, Xinglong; Brydon, Liam; Wicker, Jörg

Memento: v1.1.1 Miscellaneous

Zenedo, 2024.

Kim, Jonathan; Urschler, Martin; Riddle, Pat; Wicker, Jörg

Attacking the Loop: Adversarial Attacks on Graph-based Loop Closure Detection Proceedings Article

In: Proceedings of the 19th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications, pp. 90-97, 2024.

Abstract | Links | BibTeX

@inproceedings{kim2024attacking,

title = {Attacking the Loop: Adversarial Attacks on Graph-based Loop Closure Detection},

author = {Jonathan Kim and Martin Urschler and Pat Riddle and J\"{o}rg Wicker },

url = {http://arxiv.org/abs/2312.06991

https://doi.org/10.48550/arxiv.2312.06991},

doi = {10.5220/0012313100003660},

year  = {2024},

date = {2024-02-27},

urldate = {2024-02-27},

booktitle = {Proceedings of the 19th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications},

volume = {4},

pages = {90-97},

abstract = {With the advancement in robotics, it is becoming increasingly common for large factories and warehouses to incorporate visual SLAM (vSLAM) enabled automated robots that operate closely next to humans. This makes any adversarial attacks on vSLAM components potentially detrimental to humans working alongside them. Loop Closure Detection (LCD) is a crucial component in vSLAM that minimizes the accumulation of drift in mapping, since even a small drift can accumulate into a significant drift over time. Previous work by Kim et al. , unified visual features and semantic objects into a single graph structure for finding loop closure candidates. While this provided a performance improvement over visual feature-based LCD, it also created a single point of vulnerability for potential graph-based adversarial attacks. Unlike previously reported visual-patch based attacks, small graph perturbations are far more challenging to detect, making them a more significant threat. In this paper, we present Adversarial-LCD, a novel black-box evasion attack framework that employs an eigencentrality-based perturbation method and an SVM-RBF surrogate model with a Weisfeiler-Lehman feature extractor for attacking graph-based LCD. Our evaluation shows that the attack performance of Adversarial-LCD was superior to that of other machine learning surrogate algorithms, including SVM-linear, SVM-polynomial, and Bayesian classifier, demonstrating the effectiveness of our attack framework. Furthermore, we show that our eigencentrality-based perturbation method outperforms other algorithms, such as Random-walk and Shortest-path, highlighting the efficiency of Adversarial-LCD’s perturbation selection method.},

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

2023

Dost, Katharina; Tam, Jason; Lorsbach, Tim; Schmidt, Sebastian; Wicker, Jörg

Defining Applicability Domain in Biodegradation Pathway Prediction Unpublished Forthcoming

Forthcoming.

Abstract | Links | BibTeX

Chang, Xinglong; Dost, Katharina; Dobbie, Gillian; Wicker, Jörg

Poison is Not Traceless: Fully-Agnostic Detection of Poisoning Attacks Unpublished Forthcoming

Forthcoming.

Abstract | Links | BibTeX

Chang, Xinglong; Dobbie, Gillian; Wicker, Jörg

Fast Adversarial Label-Flipping Attack on Tabular Data Unpublished Forthcoming

Forthcoming.

Abstract | Links | BibTeX

Pullar-Strecker, Zac; Chang, Xinglong; Brydon, Liam; Ziogas, Ioannis; Dost, Katharina; Wicker, Jörg

Memento: Facilitating Effortless, Efficient, and Reliable ML Experiments Proceedings Article

In: Morales, Gianmarco De Francisci; Perlich, Claudia; Ruchansky, Natali; Kourtellis, Nicolas; Baralis, Elena; Bonchi, Francesco (Ed.): Machine Learning and Knowledge Discovery in Databases: Applied Data Science and Demo Track, pp. 310-314, Springer Nature Switzerland, Cham, 2023, ISBN: 978-3-031-43430-3.

Abstract | Links | BibTeX

Chang, Luke; Dost, Katharina; Zhai, Kaiqi; Demontis, Ambra; Roli, Fabio; Dobbie, Gillian; Wicker, Jörg

BAARD: Blocking Adversarial Examples by Testing for Applicability, Reliability and Decidability Proceedings Article

In: Kashima, Hisashi; Ide, Tsuyoshi; Peng, Wen-Chih (Ed.): The 27th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD), pp. 3-14, Springer Nature Switzerland, Cham, 2023, ISSN: 978-3-031-33374-3.

Abstract | Links | BibTeX

@inproceedings{chang2021baard,

title = {BAARD: Blocking Adversarial Examples by Testing for Applicability, Reliability and Decidability},

author = {Luke Chang and Katharina Dost and Kaiqi Zhai and Ambra Demontis and Fabio Roli and Gillian Dobbie and J\"{o}rg Wicker},

editor = {Hisashi Kashima and Tsuyoshi Ide and Wen-Chih Peng},

url = {https://arxiv.org/abs/2105.00495

https://github.com/wickerlab/baard},

doi = {10.1007/978-3-031-33374-3_1},

issn = {978-3-031-33374-3},

year  = {2023},

date = {2023-05-27},

urldate = {2023-05-27},

booktitle = {The 27th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD)},

journal = {The 27th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD)},

pages = {3-14},

publisher = {Springer Nature Switzerland},

address = {Cham},

abstract = {Adversarial defenses protect machine learning models from adversarial attacks, but are often tailored to one type of model or attack. The lack of information on unknown potential attacks makes detecting adversarial examples challenging. Additionally, attackers do not need to follow the rules made by the defender. To address this problem, we take inspiration from the concept of Applicability Domain in cheminformatics. Cheminformatics models struggle to make accurate predictions because only a limited number of compounds are known and available for training. Applicability Domain defines a domain based on the known compounds and rejects any unknown compound that falls outside the domain. Similarly, adversarial examples start as harmless inputs, but can be manipulated to evade reliable classification by moving outside the domain of the classifier. We are the first to identify the similarity between Applicability Domain and adversarial detection. Instead of focusing on unknown attacks, we focus on what is known, the training data. We propose a simple yet robust triple-stage data-driven framework that checks the input globally and locally, and confirms that they are coherent with the model’s output. This framework can be applied to any classification model and is not limited to specific attacks. We demonstrate these three stages work as one unit, effectively detecting various attacks, even for a white-box scenario.},

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

Chen, Zeyu; Dost, Katharina; Zhu, Xuan; Chang, Xinglong; Dobbie, Gillian; Wicker, Jörg

Targeted Attacks on Time Series Forecasting Proceedings Article

In: Kashima, Hisashi; Ide, Tsuyoshi; Peng, Wen-Chih (Ed.): The 27th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD), pp. 314-327, Springer Nature Switzerland, Cham, 2023, ISSN: 978-3-031-33383-5.

Abstract | Links | BibTeX

@inproceedings{Chen2023targeted,

title = {Targeted Attacks on Time Series Forecasting},

author = {Zeyu Chen and Katharina Dost and Xuan Zhu and Xinglong Chang and Gillian Dobbie and J\"{o}rg Wicker},

editor = {Hisashi Kashima and Tsuyoshi Ide and Wen-Chih Peng},

url = {https://github.com/wickerlab/nvita},

doi = {10.1007/978-3-031-33383-5_25},

issn = {978-3-031-33383-5},

year  = {2023},

date = {2023-05-26},

urldate = {2023-05-26},

booktitle = {The 27th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD)},

pages = {314-327},

publisher = {Springer Nature Switzerland},

address = {Cham},

abstract = {Abstract. Time Series Forecasting (TSF) is well established in domains dealing with temporal data to predict future events yielding the basis for strategic decision-making. Previous research indicated that forecasting models are vulnerable to adversarial attacks, that is, maliciously crafted perturbations of the original data with the goal of altering the model’s predictions. However, attackers targeting specific outcomes pose a substantially more severe threat as they could manipulate the model and bend it to their needs. Regardless, there is no systematic approach for targeted adversarial learning in the TSF domain yet. In this paper, we introduce targeted attacks on TSF in a systematic manner. We establish a new experimental design standard regarding attack goals and perturbation control for targeted adversarial learning on TSF. For this purpose, we present a novel indirect sparse black-box evasion attack on TSF, nVita. Additionally, we adapt the popular white-box attacks Fast Gradient Sign Method (FGSM) and Basic Iterative Method (BIM). Our experiments confirm not only that all three methods are effective but also that current state-of-the-art TSF models are indeed susceptible to attacks. These results motivate future research in this area to achieve higher reliability of forecasting models.},

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

Dost, Katharina; Pullar-Strecker, Zac; Brydon, Liam; Zhang, Kunyang; Hafner, Jasmin; Riddle, Pat; Wicker, Jörg

Combatting over-specialization bias in growing chemical databases Journal Article

In: Journal of Cheminformatics, vol. 15, iss. 1, pp. 53, 2023, ISSN: 1758-2946.

Abstract | Links | BibTeX

@article{Dost2023Combatting,

title = {Combatting over-specialization bias in growing chemical databases},

author = {Katharina Dost and Zac Pullar-Strecker and Liam Brydon and Kunyang Zhang and Jasmin Hafner and Pat Riddle and J\"{o}rg Wicker},

url = {https://jcheminf.biomedcentral.com/articles/10.1186/s13321-023-00716-w



},

doi = {10.1186/s13321-023-00716-w},

issn = {1758-2946},

year  = {2023},

date = {2023-05-19},

urldate = {2023-05-19},

journal = {Journal of Cheminformatics},

volume = {15},

issue = {1},

pages = {53},

abstract = {Background



Predicting in advance the behavior of new chemical compounds can support the design process of new products by directing the research toward the most promising candidates and ruling out others. Such predictive models can be data-driven using Machine Learning or based on researchers’ experience and depend on the collection of past results. In either case: models (or researchers) can only make reliable assumptions about compounds that are similar to what they have seen before. Therefore, consequent usage of these predictive models shapes the dataset and causes a continuous specialization shrinking the applicability domain of all trained models on this dataset in the future, and increasingly harming model-based exploration of the space.

Proposed solution



In this paper, we propose cancels (CounterActiNg Compound spEciaLization biaS), a technique that helps to break the dataset specialization spiral. Aiming for a smooth distribution of the compounds in the dataset, we identify areas in the space that fall short and suggest additional experiments that help bridge the gap. Thereby, we generally improve the dataset quality in an entirely unsupervised manner and create awareness of potential flaws in the data. cancels does not aim to cover the entire compound space and hence retains a desirable degree of specialization to a specified research domain.

Results



An extensive set of experiments on the use-case of biodegradation pathway prediction not only reveals that the bias spiral can indeed be observed but also that cancels produces meaningful results. Additionally, we demonstrate that mitigating the observed bias is crucial as it cannot only intervene with the continuous specialization process, but also significantly improves a predictor’s performance while reducing the number of required experiments. Overall, we believe that cancels can support researchers in their experimentation process to not only better understand their data and potential flaws, but also to grow the dataset in a sustainable way. All code is available under github.com/KatDost/Cancels.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Background

Predicting in advance the behavior of new chemical compounds can support the design process of new products by directing the research toward the most promising candidates and ruling out others. Such predictive models can be data-driven using Machine Learning or based on researchers’ experience and depend on the collection of past results. In either case: models (or researchers) can only make reliable assumptions about compounds that are similar to what they have seen before. Therefore, consequent usage of these predictive models shapes the dataset and causes a continuous specialization shrinking the applicability domain of all trained models on this dataset in the future, and increasingly harming model-based exploration of the space.
Proposed solution

In this paper, we propose cancels (CounterActiNg Compound spEciaLization biaS), a technique that helps to break the dataset specialization spiral. Aiming for a smooth distribution of the compounds in the dataset, we identify areas in the space that fall short and suggest additional experiments that help bridge the gap. Thereby, we generally improve the dataset quality in an entirely unsupervised manner and create awareness of potential flaws in the data. cancels does not aim to cover the entire compound space and hence retains a desirable degree of specialization to a specified research domain.
Results

An extensive set of experiments on the use-case of biodegradation pathway prediction not only reveals that the bias spiral can indeed be observed but also that cancels produces meaningful results. Additionally, we demonstrate that mitigating the observed bias is crucial as it cannot only intervene with the continuous specialization process, but also significantly improves a predictor’s performance while reducing the number of required experiments. Overall, we believe that cancels can support researchers in their experimentation process to not only better understand their data and potential flaws, but also to grow the dataset in a sustainable way. All code is available under github.com/KatDost/Cancels.

2022

Dost, Katharina; Duncanson, Hamish; Ziogas, Ioannis; Riddle, Pat; Wicker, Jörg

Divide and Imitate: Multi-Cluster Identification and Mitigation of Selection Bias Proceedings Article

In: 26th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD2022), pp. 149-160, Springer-Verlag, Berlin, Heidelberg, 2022, ISBN: 978-3-031-05935-3.

Abstract | Links | BibTeX

2020

Chester, Andrew; Koh, Yun Sing; Wicker, Jörg; Sun, Quan; Lee, Junjae

Balancing Utility and Fairness against Privacy in Medical Data Proceedings Article

In: IEEE Symposium Series on Computational Intelligence (SSCI), pp. 1226-1233, IEEE, 2020.

Abstract | Links | BibTeX

@inproceedings{chester2020balancing,

title = {Balancing Utility and Fairness against Privacy in Medical Data},

author = {Andrew Chester and Yun Sing Koh and J\"{o}rg Wicker and Quan Sun and Junjae Lee},

url = {https://ieeexplore.ieee.org/abstract/document/9308226},

doi = {10.1109/SSCI47803.2020.9308226},

year  = {2020},

date = {2020-12-01},

booktitle = {IEEE Symposium Series on Computational Intelligence (SSCI)},

pages = {1226-1233},

publisher = {IEEE},

abstract = {There are numerous challenges when designing algorithms that interact with sensitive data, such as, medical or financial records. One of these challenges is privacy. However, there is a tension between privacy, utility (model accuracy), and fairness. While de-identification techniques, such as generalisation and suppression, have been proposed to enable privacy protection, it comes with a cost, specifically to fairness and utility. Recent work on fairness in algorithm design defines fairness as a guarantee of similar outputs for "similar" input data. This notion is discussed in connection to de-identification. This research investigates the trade-off between privacy, fairness, and utility. In contrast, other work investigates the trade-off between privacy and utility of the data or accuracy of the model overall. In this research, we investigate the effects of two standard de-identification techniques, k-anonymity and differential privacy, on both utility and fairness. We propose two measures to calculate the trade-off between privacy-utility and privacy-fairness. Although other research has provided guarantees for privacy regarding utility, this research focuses on the trade-offs given set de-identification levels and relies on guarantees provided by the privacy preservation methods. We discuss the effects of de-identification on data of different characteristics, class imbalance and outcome imbalance. We evaluated this is on synthetic datasets and standard real-world datasets. As a case study, we analysed the Medical Expenditure Panel Survey dataset.},

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

Dost, Katharina; Taskova, Katerina; Riddle, Pat; Wicker, Jörg

Your Best Guess When You Know Nothing: Identification and Mitigation of Selection Bias Proceedings Article

In: 2020 IEEE International Conference on Data Mining (ICDM), pp. 996-1001, IEEE, 2020, ISSN: 2374-8486.

Abstract | Links | BibTeX

2017

Wicker, Jörg; Kramer, Stefan

The Best Privacy Defense is a Good Privacy Offense: Obfuscating a Search Engine User’s Profile Journal Article

In: Data Mining and Knowledge Discovery, vol. 31, no. 5, pp. 1419-1443, 2017, ISSN: 1573-756X.

Abstract | Links | BibTeX