SITIS Topic Details |
||||||
| Proposals Accepted: | |
| Program: | SBIR |
| Topic Number: | A10-172 (Army) |
| Title: | Obstacle Detection and Awareness via High-Resolution Monocular Video | Research & Technical Areas: | Sensors, Electronics |
| Acquisition Program: | Objective: | This SBIR aims to build an electro-optic mobility aid for indirect driving and situational awareness applications that allows Soldiers to detect and analyze distant obstacles from a moving manned or unmanned ground vehicle with high-resolution monocular video.
| Description: | The United States Army increasingly aims to integrate advanced sensor technologies onto its manned and unmanned ground vehicles to advance Soldiers’ situational awareness and indirect driving capabilities in the field. In this regard, monocular video is a very critical sensing technology because it is fairly inexpensive and requires relatively little power. In addition, monocular camera technologies are reliable; and, such technologies emit almost no electromagnetic signature. In recent years, monocular cameras have offered increased spatial resolution; however, the application of these high-resolution imaging technologies upon military ground vehicles is hindered by the limited resolution of their display panels and bandwidth of their content delivery systems. Often, engineers alleviate this issue by down-sampling the original monocular signal to match the system’s display resolution or data transfer capability; however, with this approach, Soldiers cannot view a local Region of Interest (ROI) in the highest captured resolution. Soldiers in the field increasingly use high-resolution monocular video to operate their systems sometimes, beneath a closed hatch. These vehicle operators have no access to windows that could improve situational awareness; and, they increasingly require capabilities to understand the threat of potential obstacles from a large distance. The Army therefore desires the capability to use existing high-resolution monocular cameras to their fullest potential to detect and understand the threat of obstacles while driving. Virtual Pan-Tilt-Zoom (PTZ) methods may be used to partially address this issue. Virtual PTZ algorithms allow Soldiers to pan, tilt, and zoom through a high-resolution video feed in real-time without mechanical components. These algorithms thereby allow Soldiers to view potential obstacles within some local ROI from a large distance in the highest captured resolution. Unfortunately, no known technologies exist that meld Virtual PTZ methods with those that detect and understand obstacles from a vehicle moving at speeds of up to sixty miles per hour. These algorithms would benefit ongoing Army operations. For instance, current capabilities to detect and optically zoom upon an obstacle of interest are limited by the number of zoom-capable cameras upon the vehicle. A versatile Virtual PTZ framework would permit one high-resolution monocular camera to simultaneously detect, track, and zoom upon several obstacles of interest from a large distance. A robust Virtual PTZ framework must allow random access to any local ROI with little latency. Obstacles may be dynamic or stationary. The obstacle detection and tracking system must be robust to physical occlusion and environmental conditions for instance, varying light levels, shadows, and precipitation. The tracking system must account for orientation shifts to obstacles as the vehicle moves; and, the system must at a minimum offer a method to measure an obstacle threat to the vehicle and alert its operator as appropriate. The solution must provide all of this information and capability to the Soldier in an intuitive manner.
| PHASE I: Design the Obstacle Detection and Awareness System with a High-Resolution Monocular Camera. Provide a report that describes the intended implementation. At a minimum, it must explain the algorithm, provide a user interface concept, describe significant design trade-offs, estimate an implementation schedule, and include a risk mitigation matrix. In addition, applicants must conduct proof-of-principle experiments to support their concept and provide evidence of its viability.
| PHASE II: Completely develop the Obstacle Detection and Awareness System for a High-Resolution Monocular Camera. A prototype shall be integrated onto a commercial vehicle or robot; however, care must be taken to ensure its eventual integration on a manned or unmanned military ground vehicle. It shall be demonstrated in an outdoor urban environment of the Government’s choice. Reports shall be delivered that document Project-related activities, the system’s technical specifications, and a User’s Guide.
| PHASE III: The Obstacle Detection and Awareness System described herein may be integrated onto a fleet of manned and unmanned military ground vehicles to improve operators’ situational awareness and indirect driving capabilities in combat. These capabilities are particularly important for urban operations wherein obstacles are dispersed throughout cluttered and dynamic environments; but, they may also be employed in off-road mountainous terrains wherein obstacles are hidden throughout natural landscapes. Virtual PTZ algorithms may be employed in various commercial applications, such as interactive television, wide-area surveillance, and interactive streaming media. Combined with the obstacle detection and awareness technologies detailed herein, the complete Obstacle Detection and Awareness System may be used in the luxury automobile and emergency response i.e., police, fire, SWAT, etc. markets to improve operators’ awareness of obstacles while driving. The IMOPAT ATO strongly supports this effort; and, if successful, it aims to provide non-SBIR funding after Phase II to integrate the technology onto future military vehicle platforms.
| References: | 1. Mavlankar, A., Baccichet, P., Varodayan, D., and Girod, B., Optimal Slice Size for Streaming Regions of High Resolution Video with Virtual Pan/Tilt/Zoom Functionality, Proc. of 15th European Signal Processing Conference (EUSIPCO), Poznan, Poland, Sept. 2007. 2. Sinn, R., Virtual Pan-Tilt-Zoom for a Wide-Area-Video Surveillance System Master’s Thesis, Massachusetts Institute of Technology, September 2008. 3. Ulrich, I., and Nourbakhsh, I., Appearance-Based Obstacle Detection with Monocular Color Vision, Proc. of AAAI 2000. 4. Regensburger, U., and Graefe, V., Visual Recognition of Obstacles on Roads, Proc. IEEE/RSJ International Conference on Intelligent Robots and Systems, 1994. |
| Keywords: | Obstacle Detection, Indirect Driving, Situational Awareness, Tracking, Pan, Tilt, Zoom, Threat Evaluation |
Questions and Answers: |
Q: 1. What kinds of obstacles are of interest? For example, specific types of potential threats such as humans, trucks, etc. or more generally anything that might pose a collision threat, including fallen tree branches, rocks, etc. |
A: Here are the responses to your questions: |
Q: What is the definition of an "obstacle"? Do obstacles have any pattern that can be detected or it could be anything like a person, tree, vehicle, building, etc. Also, do you have any sample video from the monocular camera? |
A: 1) From our perspective, an obstacle is anything that can impede a |
Q: 1. What are the maximum slew rates at which a camera may be virtually Panned, Tilted or Zoomed by an operator? |
A: 1) I don't have a maximum slew rate in mind; but, it should be set to ensure effective usability from the Warfighter. It shouldn't be any less than 70 degrees for pan and 30 degrees for tilt. |
Q: Will there be multiple users, that is, multiple simultaneous local ROIs? |
A: I would target this solution for the operator of the vehicle. That said, I think a framework to transmit useful information from the system to the vehicle commander and rear occupants would be useful. |
Q: Is the report which describes the intended implementation geared towards the actual development of the system in phase? |
A: I don't exactly understand the question - but, the report at the end of Phase I should describe the intended implementation in Phase II. |
Q: What is expected in the risk mitigation matrix? |
A: Essentially, I want to know about the potential difficulties or vulnerabilities of the proposed system ahead of time. i.e., have certain portions of your solution never been done before?, what is the likelihood of failure for critical portions of the system based on a reasonable assessment of the solution?, what are the probable impacts of failure?, etc. |
Q: Is this solicitation permitted to have a phase 1 option? |
A: Yes - a Phase I Option is permitted for this solicitation. Definitely include a Phase I Option in your proposal. |
Q: What can be considered as viable work to be included in the phase 1 option of the solicitation? |
A: The Phase I Option is meant to provide a bridge between Phase I and Phase II. It is worth $50K; and, it would last approximately 4 months. It can include any work that moves the project forward during the contract negotiation process. |
Q: Are you expecting to see any high level plans for the phase 2 portion of this solicitation? |
A: The only documentation that I expect from Phase II is a report that details the design and implementation of the system, a report that provides the technical specifications of the system, and a User's Guide to operate the system. |
Q: Is there a need to provide full motion video (e.g., 30 fps) for all tracked targets in their respective virtual zoom windows, or would it be acceptable if tracked targets are updated at a slower rate (e.g., 3 fps)? This question seekss to understand whether fluid video motion is more important or whether an ability to track a larger number of simultaneous targets is more important. |
A: I think there needs to be a balance between these issues. For instance, I don't think the number of targets to be tracked should be completely unbounded - first, this would increase the possibility of false alarms; and also, too large a number of tracked and displayed objects could render the system unusable. On the other hand, we expect the vehicle to move at speeds of up to sixty miles per hour; and as such, a 3 fps tracking system might be too slow - especially for objects near the vehicle. |
As of midnight September 1, questions for solicitations SBIR 10.3 and STTR 10.B will no longer be accepted.
To read the solicitation for full proposal preparation and submission details click here. |