An adaptive combination algorithm based on deep learning and genetic algorithm for anomalous events detection

Question

1. What are the major goals of security systems?

2. What anomaly detection methods are suggested?

3. What is the total number of videos in the UCF-Crime dataset?

4. How does deep learning process unstructured data?

Accepted Answer

The major goals of security systems are the effectiveness and quickness of video anomaly detection. These systems aim to improve public security by utilizing video surveillance systems (VSS) in various locations such as malls, roads, smart cities, hospitals, markets, banks, and educational institutions. The effectiveness of these systems is crucial in detecting and responding to abnormal behavior or events in real-time. To achieve this, several security cameras are installed across the world, generating large volumes of video data. However, the sheer amount of data requires significant human resources for anomalous case detection and real-time video analysis. Moreover, human surveillance of abnormalities is often ineffective due to the potential for human error and loss of focus over time. This has led to the development of autonomous anomaly detection approaches based on artificial intelligence (AI) techniques. AI-based systems can analyze video data more efficiently and accurately, reducing the reliance on human intervention. In the literature, various methods have been proposed to explain anomalous behavior, such as the occurrence of variance in regular patterns. These methods are applied in different contexts, including traffic security, automated intelligent visual monitoring, and crime prevention. Traditionally, video anomaly detection was considered a one-class classification problem, where the classifier is trained on regular videos, and a video is labeled as abnormal when it deviates from the norm. However, advancements in AI and machine learning have expanded the scope of anomaly detection, enabling more sophisticated and accurate identification of abnormal events in video surveillance systems.

Accepted Answer

Several anomaly detection methods are suggested in the provided section. Sultani et al. propose a framework to detect unusual attitudes and inform users. Shreyas et al. recommend reducing video file size before detection. Anala et al., Hao et al., and Dubey et al. address anomaly events as a regression problem. Ullah et al. present a lightweight CNN. Zaheer et al. propose a weakly supervised model based on video-level labels. Majhi et al. offer a weakly supervised learning model for anomaly detection. Wu et al. introduce a dual-branch network with multi-detail concepts. Cao et al. consider spatial-temporal relationships for anomaly detection. Abbas and Al-Ani suggest video compression and feature map reduction using H265 and principal component analysis. Abbas and Al-Ani also propose using a genetic algorithm for features selection. The BiLSTM model is used for classifying anomalies based on spatio-temporal features.

Accepted Answer

The UCF-Crime dataset contains a total of 1,900 videos. These videos are divided into 800 normal videos and 810 anomaly videos for training, while the testing phase includes 150 normal videos and 140 anomaly videos. The dataset comprises over 129 hours of films at a resolution of 320x240 and 13 million frames. The dataset was selected due to its diverse range of abnormal event categories and the significant impact of its abnormalities on community security. For the research experimentation, videos with lengths less than or equal to two minutes were chosen, resulting in 1,324 videos being used. These videos were divided into 1,116 for the training stage (in a ratio of 90:10 for training and validation respectively) and 208 for the testing stage.

Accepted Answer

Deep learning (DL) processes unstructured data by gradually recognizing and comprehending its various facets. DL, a subset of machine learning (ML), separates input into layers, with each level extracting features and transmitting them to the layer above. The first layers collect fundamental data, which is coupled with explanations offered by the next layers. As the amount of information increases, the effectiveness of DL classifiers greatly improves compared to standard learning models. DL utilizes various designs such as recurrent neural networks (RNN), pre-trained networks, CNN, and others for different applications. CNN, for example, is commonly used in image processing and requires less setup than other categorization techniques. It uses appropriate filters to discover spatial and temporal relationships from an image. RNN, on the other hand, is better at understanding sequence information than CNN, as it employs state variables to store historical information and combine it with present input to forecast present outcomes. An example of an RNN is the Long Short-Term Memory (LSTM) network. Overall, DL's ability to process unstructured data has significantly expanded with the availability of data and powerful computers.

Accepted Answer

GA reduces feature numbers by optimizing the feature map. In the research, GA is employed to compute a new features map, which is input into the binary classifier instead of the retrieved features from ResNet50. This process results in a variation in the number of feature vectors between the dataset before and after applying GA for both the training and testing dataset. The red curve in Figure 3 indicates the number of feature vectors for the video before applying GA, while the blue curve indicates the number of feature vectors after applying GA. This reduction in feature numbers helps in improving the efficiency and performance of the model.

Accepted Answer

In this research, the features obtained from the feature selection stage were utilized as input to the BiLSTM classifier model for detecting anomaly events in videos. This approach differs from previous works that used features extracted during the feature extraction stage. By using features from the selection stage, the research aimed to improve the classifier's performance in identifying anomalies in video sequences. The selected features were crucial in training the BiLSTM model, which was employed as a classifier to detect anomalies effectively. The model's variables were chosen through trial and error, and the effectiveness of the suggested model was assessed using the receiver operating characteristics (ROC) and the area under the curve (AUC) metrics. The results showed that the proposed model with GA (Genetic Algorithm) achieved a higher AUC score of 94.58% compared to the model without GA, which had an AUC score of 92.47%. Additionally, the model trained on features subject to GA took approximately half the time to train compared to the model trained on features not subject to GA. Overall, the use of features from the feature selection stage as input to the BiLSTM classifier proved to be effective in detecting anomalies in video sequences.

Accepted Answer

In this research, a pre-trained ResNet50 model was employed for extracting the features. The features were taken from the fc1,000 layer, resulting in 1,000 aspects per frame. For a movie with x frames, the number of features will be 1,000 * x, which is considered huge data. To improve classification accuracy and save training time, the retrieved features were passed to the GA model before classifying them for the UCF-Crime dataset.

Accepted Answer

The proposed system merges ML with DL, using Resnet50 for feature extraction and GA for feature map generation. This approach has shown greater accuracy compared to earlier works, with an AUC value improvement of up to 94.58% on the UCF-Crime dataset. The model achieved 89.90% detection accuracy, highlighting the importance of minimizing false alarms. Future works will explore other dimensionality reduction, feature selection, and extraction methods to further enhance accuracy.

An adaptive combination algorithm based on deep learning and genetic algorithm for anomalous events detection

Chat with Paper

AI Agents for this Paper

Most frequently asked questions

1. What are the major goals of security systems?

2. What anomaly detection methods are suggested?

3. What is the total number of videos in the UCF-Crime dataset?

4. How does deep learning process unstructured data?

5. How does GA reduce feature numbers?

6. What features were used as input for the BiLSTM classifier?

7. What model was used for feature extraction in this research?

8. How does the proposed system improve anomaly detection accuracy?

Citations

Enhancing brain tumor detection: integrating CNN-LSTM and CNN-BiLSTM models for efficient classification in MRI images

References

Real-World Anomaly Detection in Surveillance Videos

Introduction to Genetic Algorithms for Scientists and Engineers

Dive Into Deep Learning

Choosing Mutation and Crossover Ratios for Genetic Algorithms—A Review with a New Dynamic Approach

Learning Regularity in Skeleton Trajectories for Anomaly Detection in Videos

Related Papers (5)

Physician-Friendly Machine Learning: A Case Study with Cardiovascular Disease Risk Prediction

Unsupervised deep learning-based process monitoring methods

Breakdown of Machine Learning Algorithms

A genetic algorithm for reliability-oriented task assignment in a distributed system

Language Identification as Process Prediction Using WoMan