Beyond Peak Performance: Validating Document AI with Intelligent Testing

A new approach frames the validation of Intelligent Document Processing systems as a search-based software testing problem, prioritizing the discovery of diverse risk factors over achieving maximum accuracy.
![The numerical spectrum of quantum states-characterized by spin [latex]SS[/latex] and R-charge [latex]JJ[/latex]-reveals a stringy behavior at strong coupling, where energy levels align with predictions from flat-space string theory and incorporate additional KK-modes for the lowest mass levels, suggesting a graceful decay of the system into a well-defined, albeit evolved, state.](https://arxiv.org/html/2601.21992v1/plot.png)

![Traditional natural language processing attacks demonstrate limited transferability and reduced effectiveness against in-context learning classifiers-as evidenced by consistently lower attack success rates (measured as Attack Success Rate [latex]ASR[/latex] and robust [latex]rASR[/latex]) across varying perturbation budgets-highlighting the need for attack strategies specifically designed for this emerging paradigm.](https://arxiv.org/html/2601.21586v1/x44.png)




