AI-Optimized Distributed Caching for Ultra-Low-Latency Authorization Networks
Keywords:
ultra-low-latency, distributed caching, authorization networks, Bayesian optimization, reinforcement learningAbstract
In-memory distributed caching lets authorisation networks with very low latency reply which is less than 500 milliseconds. But different methods of giving permission make it difficult to maintain numerous groups of caches in sync, reduce congestion, and make the best use of resources. With the help of Bayesian optimisation and reinforcement learning to build an AI-optimized distributed caching system that takes care of cache partitioning, eviction rules, and replication methods.
Downloads
References
Y. Li, J. Xu, and M. Chen, "A Survey on Distributed Caching in Cloud Computing: Architectures, Strategies, and Applications," IEEE Communications Surveys & Tutorials, vol. 23, no. 4, pp. 2458–2487, Fourthquarter 2023.
S. Wang, T. Li, and J. Yang, "Reinforcement Learning-Based Cache Management in Distributed Systems: A Review," IEEE Transactions on Network and Service Management, vol. 19, no. 1, pp. 100–113, Mar. 2022.
H. Zhang, Z. Liu, and W. Gao, "Dynamic Cache Partitioning for Multi-Tenant Cloud Applications," IEEE Transactions on Cloud Computing, vol. 11, no. 1, pp. 300–312, Jan.–Mar. 2023.
L. Chen and K. Li, "Bayesian Optimization for Hyperparameter Tuning in Distributed Systems," in Proc. IEEE Int. Conf. Cloud Computing, 2023, pp. 102–111.
X. Guo, Y. Zhao, and J. Wang, "AI-Driven Cache Eviction Policies: A Reinforcement Learning Approach," IEEE Transactions on Parallel and Distributed Systems, vol. 34, no. 2, pp. 450–463, Feb. 2023.
M. Patel and R. Kumar, "Workload Prediction for Distributed Cache Management Using Time Series Models," IEEE Transactions on Services Computing, vol. 15, no. 4, pp. 2005–2017, Jul.–Aug. 2022.
A. Singh and P. Sharma, "Proactive Data Placement in Distributed Caching via Machine Learning," IEEE Transactions on Cloud Computing, vol. 12, no. 2, pp. 456–469, Apr.–Jun. 2024.
Y. Sun and M. J. Neely, "Hotspot Detection and Mitigation in Distributed Cache Systems," IEEE/ACM Transactions on Networking, vol. 31, no. 1, pp. 64–78, Feb. 2023.
J. Kim and S. Park, "Cache Replication Strategies for Latency-Sensitive Authorization Workflows," IEEE Transactions on Network and Service Management, vol. 20, no. 1, pp. 90–102, Mar. 2023.
V. Gupta, N. Singh, and A. Garg, "Latency-Aware Resource Allocation for Distributed Authorization Systems," IEEE Transactions on Cloud Computing, vol. 13, no. 1, pp. 75–88, Jan.–Mar. 2025.
M. Zhang, Y. Chen, and K. Tan, "Bayesian Optimization in Distributed Systems: Techniques and Applications," IEEE Transactions on Systems, Man, and Cybernetics: Systems, vol. 53, no. 1, pp. 15–29, Jan. 2023.
S. R. Das, A. Kumar, and P. M. Shah, "AI-Based Cache Management for Edge Computing: Challenges and Future Directions," IEEE Internet of Things Journal, vol. 9, no. 15, pp. 13400–13415, Aug. 2022.
R. Sharma and L. D. Xu, "Reinforcement Learning for Adaptive Caching in Distributed Networks," IEEE Transactions on Mobile Computing, vol. 22, no. 3, pp. 789–803, Mar. 2023.
D. Lee, Y. Cho, and M. Choi, "Cache Locality Optimization Using Predictive Analytics in Authorization Systems," in Proc. IEEE Int. Conf. Distributed Computing Systems, 2024, pp. 89–98.
F. Wang and G. Liu, "Balancing Exploration and Exploitation in Cache Policy Optimization," IEEE Transactions on Neural Networks and Learning Systems, vol. 34, no. 6, pp. 2543–2556, Jun. 2023.
K. Yao and H. Wu, "Cost-Efficient Distributed Caching for Authorization Workloads in Cloud Environments," IEEE Transactions on Cloud Computing, vol. 11, no. 4, pp. 1195–1207, Oct.–Dec. 2023.
T. Nguyen and J. Kim, "Workload-Aware Cache Management via Deep Reinforcement Learning," IEEE Transactions on Network Science and Engineering, vol. 10, no. 1, pp. 89–102, Jan.–Mar. 2023.
L. Huang and Q. Zhang, "Adaptive Cache Eviction Using Bayesian Inference," in Proc. IEEE Conf. Computer Communications (INFOCOM), 2023, pp. 154–163.
P. Kumar and A. Singh, "AI-Guided Data Replication for Low-Latency Distributed Systems," IEEE Transactions on Parallel and Distributed Systems, vol. 33, no. 9, pp. 2145–2157, Sep. 2022.
J. R. Smith, M. Brown, and L. Garcia, "Real-Time Telemetry-Driven Cache Management in Cloud Authorization Networks," IEEE Transactions on Cloud Computing, vol. 13, no. 1, pp. 120–132, Jan.–Mar. 2025.