limitedDistribution · Industry Research

Prompt-to-policy: Leveraging large language models to guide deep reinforcement learning in public health emergencies

The study explores a hybrid agent using LLMs and RAG for dynamic decision-making in public health, enhancing data processing and system robustness.

X. Wang, D. Duma, M.R. Dihan, P. Chowdhury, L. Thul, J. Liu, W. Xiao, J. Cui, L. Yu, J. Lee, Z. Du, D. Š.emrov, T. Sun, M.D. Stosic, V. Chaudhary, R.M. Anderson, S.L. Wang, K. Huang, J. Zhang, E. Keyvanshokooh, Y. Zhang, Y. Wang, T. Ning, D. Hachiya, J. Fan, Q. Liu, H. Li, H. ZhouFebruary 22, 20261 min read

Prompt-to-policy: Leveraging large language models to guide deep reinforcement learning in public health emergencies

Stargo's RAG-powered insights enhance decision-making by processing heterogeneous data, aligning with the article's focus on LLMs in public health.

Executive Summary

The article discusses a hybrid agent that integrates D2QN-JDA with large language models (LLMs) for dynamic decision-making in public health emergencies. LLMs with retrieval-augmented generation (RAG) process multi-source heterogeneous data to calculate adaptive rewards. The agent is validated with pandemic data from Hong Kong, outperforming baselines in cost efficiency and robustness. The study highlights the use of LLMs to automate the processing of heterogeneous, real-time data, reducing manual intervention and enhancing system robustness. The proposed method demonstrates superior performance in accuracy, recall, F1 score, cost, and time compared to traditional methods.

Source: www.sciencedirect.com

Authors: X. Wang, D. Duma, M.R. Dihan, P. Chowdhury, L. Thul, J. Liu, W. Xiao, J. Cui, L. Yu, J. Lee, Z. Du, D. Š.emrov, T. Sun, M.D. Stosic, V. Chaudhary, R.M. Anderson, S.L. Wang, K. Huang, J. Zhang, E. Keyvanshokooh, Y. Zhang, Y. Wang, T. Ning, D. Hachiya, J. Fan, Q. Liu, H. Li, H. Zhou

Original Article: https://www.sciencedirect.com/science/article/abs/pii/S016926072600057X

More from the News Room

View all

We are publishing more related coverage here soon. Explore the full News Room for the latest articles.

See ROI in 12 weeks

See where enterprise data is slowing operations down.

Estimate the manual effort, delays, and leakage hidden across your current workflow before you automate it.

Get a Data Savings Estimate Book a Demo