Implementing generative pretrained transformer models for text recognition tasks in safety data sheets
article
Workplaces handling chemicals require an up-to-date and comprehensive assessment of the potential risks for their workforce. Online safety data sheets (SDSs) inventories provide adequate information to perform risk assessments. However, current practices that manually import information from SDSs into the online inventories are time-consuming, leading to delayed or inadequate risk assessments. This study presents a pipeline using large language models (LLMs) to automate the extraction and management of data from SDSs to online chemical inventories. The pipeline achieved an average accuracy of 0.83 in (close to precisely) extracting multiple variables of interest, such as company name, product name, and hazard statements, in comparison to manually extracting these variables. Overall, this pipeline illustrates the ability of LLM tools to automate SDS inventory management and thereby support the possibility to perform up-to-date risk assessments and evaluation tasks on the work floor, ultimately contributing to occupational safety.
TNO Identifier
1020284
Source
Anuals Work Exposures and Health, pp. Epub 24 Nov.
Pages
Epub 24 Nov