• Train




          Data Collection

          Building Blocks​

          Device Enrollment

          Monitoring Dashboards

          Video Annotation​

          Application Editor​

          Device Management

          Remote Maintenance

          Model Training

          Application Library

          Deployment Manager

          Unified Security Center

          AI Model Library

          Configuration Manager

          IoT Edge Gateway

          Privacy-preserving AI

          Ready to get started?

          Expert Services
  • Why Viso Suite
  • Pricing
Close this search box.

Fall Detection: Vision Deep Learning Application


Viso Suite is the all-in-one solution for teams to build, deliver, scale computer vision applications.

Need Computer Vision?

Viso Suite is the world’s only end-to-end computer vision platform. Request a demo.

Human falls are considered to be an important health problem worldwide. Among all potential technologies able to detect a fall, artificial vision techniques have proven extremely effective over the last years. Human Fall detection systems aim to reduce both dependency and care costs in the elderly community.

Why Is Fall Detection Important?

Fall is one of the major causes of death for older people. Over 30% of falls cause severe injuries, ranging from hip fracture to brain concussion, and a high number of them end up causing death.

Hence, human fall detection systems are increasingly important in today’s aging population and will therefore become even more important in the near future. According to UN reports on the aging population, the global population aged over 60 doubled its number in 2017 compared to 1980. It is expected to further double again by 2050 when the number of the elderly is expected to exceed the number of youngsters aged 10 to 24.

With that background scenario, automatic fall detection systems are an area of growing interest over the course of the next few years, as they could have a high impact on life quality for the dependent community.

Systems For Human Fall Detection

There are different approaches to design a fall detection system. Such systems depend on the types of devices selected for data acquisition, the location in which these devices are placed, and how fall detection is applied. Fall detection systems, in broad terms, can be classified as wearable, ambient, and vision-based ones:

  1. Wearable systems: Such systems use sensors carried by the elderly person. The technologies by this group of systems are numerous, including accelerometers, pressure sensors, inclinometers, gyroscopes, microphones, or other sensors.
  2. Placed sensors: Other systems include sensors that are placed around the monitored person and include pressure, infrared, acoustic, or radio-frequency sensors.
  3. Artificial vision: Over the past years, the use of artificial intelligence with vision has experienced rapid development, mainly due to the use of artificial neural networks and their ability to visually recognize and detect objects, events, and human actions.

Vision-based Human Fall Detection Systems

Real-time vision-based fall detection applications support elderly people through analyzing the rate of change of motion with respect to the ground point.

How it Works

Vision-based fall detection systems examine human pose, human movement, or a combination of both and categorize fall-events in case the established criteria are met.

AI vision-based fall detection systems have a common structure of two steps, the first one assigns abstract descriptors to input video feeds, and the second one classifies them. Some systems include an additional step to improve the quality of the incoming signal by reducing noise or adapting its format to the needs of the blocks downstream of it (Preprocessing).

Human Fall Detection
Kalman filter predictions for a subject position with Step 1: walking (a); Step 2: fall initiated (b); Step 3: fall terminated (c). – Source
Main Advantages

Vision-based systems are able to provide an efficient method to detect falls, without wearing any physical devices and the need to directly impact the person. Also, vision-based systems do not need to be applied correctly or require human interaction by the subject (e.g. wearing gadgets or any form of interaction).

Some advantages of vision-based systems are that they can run on different computing architectures and that there are many algorithms and libraries implemented open-source. A variety of algorithms have been developed for fall detection, some of which are designed to analyze static images or treat each frame individually, others are able to identify the occurrence of fall events over multiple frames.

Main Challenges

Vision-based devices have the same drawback as ambient devices: they have to be installed in multiple rooms to cover a larger area of interest. Another challenge is the treatment of privacy because images from a real person’s life are involved. Privacy-preserving Deep Learning

A solution is a system that processes all visuals on the edge (Edge Computing), does not send any images at all but sends an alert when the fall has been detected. Another option is sending images only when the fall has been detected (Edge AI with data offloading), those images can be blurred easily to avoid facial recognition from third parties (Face Blur). Recently, several approaches for privacy-preserving deep learning have been introduced.

Characteristics of Vision-based Fall Detection Applications

Vision-based Method

Most methods based on video analysis make use of a 2D or 3D model, others are based on feature extraction after the video image segmentation of the body. The approaches can be classified into the following categories: body and shape change, posture detection, inactivity, Spatio-temporal, and 3D head change.


In vision-based systems, cameras are one of the most important parts. The vision-based approaches are focused on the real-time execution of the algorithm using standard computing platforms and low-cost cameras. Two types of cameras are mainly used for fall detection: Regular 2D cameras and 3D cameras (ToF). The lateral resolution of time-of-flight (ToF) cameras is generally low compared to standard 2D video cameras and such specialized cameras are much more expensive.

Processing Units

Most vision-based fall detection systems depend on high computational processing power or expensive hardware. For example, a system based on foreground extraction, feature extraction, and intelligent fall detection was built using an Intel Core i5 processor at 2.6 GHz. Another approach with fall detection by detecting falls from a single camera with arbitrary view angles was set up with an Intel Core i7-2600 3.40 GHz processor, and 16 GB of RAM.

What’s next?

Read more to learn about other vision-based deep learning applications.

Follow us

Related Articles

Join 6,300+ Fellow
AI Enthusiasts

Get expert news and updates straight to your inbox. Subscribe to the Viso Blog.

Sign up to receive news and other stories from viso.ai. Your information will be used in accordance with viso.ai's privacy policy. You may opt out at any time.
Play Video
Would you like a demo?

See how your team can build your real-world AI vision systems faster with our end-to-end solution.

Join 6,300+ Fellow
AI Enthusiasts

Get expert AI news 2x a month. Subscribe to the most read Computer Vision Blog.

You can unsubscribe anytime. See our privacy policy.