limitedDistribution · Industry Research

Prompt-to-policy: Leveraging large language models to guide deep reinforcement learning in public health emergencies

The article explores a hybrid agent using LLMs and RAG for dynamic decision-making, validated on COVID-19 data, enhancing efficiency and robustness.

X. Wang, D. Duma, M.R. Dihan, P. Chowdhury, L. Thul, J. Liu, W. Xiao, J. Cui, L. Yu, J. Lee, Z. Du, D. Š.emrov, T. Sun, M.D. Stosic, V. Chaudhary, R.M. Anderson, S.L. Wang, K. Huang, J. Zhang, E. Keyvanshokooh, Y. Zhang, Y. Wang, T. Ning, D. Hachiya, J. Fan, Q. Liu, H. Li, H. ZhouFebruary 20, 20261 min read

Prompt-to-policy: Leveraging large language models to guide deep reinforcement learning in public health emergencies

Stargo's RAG-powered insights enhance decision-making by processing unstructured data, similar to the hybrid agent's efficiency in public health emergencies.

Executive Summary

The article discusses a hybrid agent that integrates D2QN-JDA with large language models (LLMs) for dynamic decision-making in public health emergencies. LLMs with retrieval-augmented generation (RAG) process multi-source heterogeneous data to calculate adaptive rewards. The D2QN-JDA method is validated on real-world COVID-19 data from Hong Kong, demonstrating superior cost efficiency and robustness compared to other algorithms. The approach automates the processing of heterogeneous, real-time data, reducing manual intervention and enhancing system robustness. LLMs outperform manual and regex methods in data extraction, improving accuracy, recall, F1 score, cost, and time.

Source: www.sciencedirect.com

Authors: X. Wang, D. Duma, M.R. Dihan, P. Chowdhury, L. Thul, J. Liu, W. Xiao, J. Cui, L. Yu, J. Lee, Z. Du, D. Š.emrov, T. Sun, M.D. Stosic, V. Chaudhary, R.M. Anderson, S.L. Wang, K. Huang, J. Zhang, E. Keyvanshokooh, Y. Zhang, Y. Wang, T. Ning, D. Hachiya, J. Fan, Q. Liu, H. Li, H. Zhou

Original Article: https://www.sciencedirect.com/science/article/abs/pii/S016926072600057X

More from the News Room

View all

We are publishing more related coverage here soon. Explore the full News Room for the latest articles.

See ROI in 12 weeks

See where enterprise data is slowing operations down.

Estimate the manual effort, delays, and leakage hidden across your current workflow before you automate it.

Get a Data Savings Estimate Book a Demo