Transcriptomics-based biomarkers are promising new approach methodologies (NAMs) to identify molecular events underlying the genotoxic mode of action of chemicals. Previously, we developed the GENOMARK biomarker, consisting of 84 genes selected based on whole genomics DNA microarray profiles of 24 (non-)genotoxic reference chemicals covering different modes of action in metabolically competent human HepaRG™ cells. In the present study, new prediction models for genotoxicity were developed based on an extended reference dataset of 38 chemicals including existing as well as newly generated gene expression data. Both unsupervised and supervised machine learning algorithms were used, but as unsupervised machine learning did not clearly distinguish both groups, the performance of two supervised machine learning algorithms, i.e., support vector machine (SVM) and random forest (RF), was evaluated. More specifically, the predictive accuracy was compared, the sensitivity to outliers for one or more biomarker genes was assessed, and the prediction performance for 10 misleading positive chemicals exposed at their IC10 concentration was determined. In addition, the applicability of both prediction models on a publicly available gene expression dataset, generated with RNA-sequencing, was investigated. Overall, the RF and SVM models were complementary in their classification of chemicals for genotoxicity. To facilitate data analysis, an online application was developed, combining the outcomes of both prediction models. Furthermore, this research demonstrates that the combination of gene expression data with supervised machine learning algorithms can contribute to the ongoing paradigm shift towards a more human-relevant in vitro genotoxicity testing strategy without the use of experimental animals.