This chapter explores the transformative impact of deep learning on real-time video understanding for public surveillance and safety analytics. We delve into the foundational concepts, advanced techniques, and practical applications of deep learning models in analyzing vast streams of video data from surveillance cameras. The chapter provides a comprehensive overview of state-of-the-art methodologies, including object detection, tracking, and anomaly detection, which are critical for enhancing public safety. We propose a hybrid deep learning framework that integrates Convolutional Neural Networks (CNNs) for spatial feature extraction and Long Short- Term Memory (LSTM) networks for temporal analysis, enabling robust and efficient real-time video understanding. The performance of the proposed methodology is evaluated on a public dataset, demonstrating its effectiveness in identifying and classifying various activities and events in surveillance footage. The chapter concludes with a discussion of the results, challenges, and future directions in this rapidly evolving field.