7
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Spatiotemporal Modeling for Crowd Counting in Videos

      Preprint
      , ,

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Region of Interest (ROI) crowd counting can be formulated as a regression problem of learning a mapping from an image or a video frame to a crowd density map. Recently, convolutional neural network (CNN) models have achieved promising results for crowd counting. However, even when dealing with video data, CNN-based methods still consider each video frame independently, ignoring the strong temporal correlation between neighboring frames. To exploit the otherwise very useful temporal information in video sequences, we propose a variant of a recent deep learning model called convolutional LSTM (ConvLSTM) for crowd counting. Unlike the previous CNN-based methods, our method fully captures both spatial and temporal dependencies. Furthermore, we extend the ConvLSTM model to a bidirectional ConvLSTM model which can access long-range information in both directions. Extensive experiments using four publicly available datasets demonstrate the reliability of our approach and the effectiveness of incorporating temporal information to boost the accuracy of crowd counting. In addition, we also conduct some transfer learning experiments to show that once our model is trained on one dataset, its learning experience can be transferred easily to a new dataset which consists of only very few video frames for model adaptation.

          Related collections

          Most cited references6

          • Record: found
          • Abstract: not found
          • Book Chapter: not found

          Learning to Count with CNN Boosting

            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            Detecting Humans in Dense Crowds Using Locally-Consistent Scale Prior and Global Occlusion Reasoning

              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              Data-Driven Crowd Understanding: A Baseline for a Large-Scale Crowd Dataset

                Bookmark

                Author and article information

                Journal
                25 July 2017
                Article
                1707.07890
                ba21c555-af28-4c1a-b00c-fb6d303a5287

                http://creativecommons.org/licenses/by/4.0/

                History
                Custom metadata
                Accepted by ICCV 2017
                cs.CV

                Comments

                Comment on this article