Summary: “Object and Action Recognition Assisted by Computational Linguistics”.
The aim of this project is to investigate how computer vision methods such as object and
action recognition may be assisted by computational linguistic models, such as WordNet.
The main challenge of object and action recognition is the scalability of methods from
dealing with a dozen of categories (e.g. PASCAL VOC) to thousands of concepts (e.g.
ImageNet ILSVRC). This project is expected to contribute to the application of automated
visual content annotation and more widely to bridging the semantic gap between
computational approaches of vision and language.
Summary: “Object and Action Recognition Assisted by Computational Linguistics”.
The aim of this project is to investigate how computer vision methods such as object and
action recognition may be assisted by computational linguistic models, such as WordNet.
The main challenge of object and action recognition is the scalability of methods from
dealing with a dozen of categories (e.g. PASCAL VOC) to thousands of concepts (e.g.
ImageNet ILSVRC). This project is expected to contribute to the application of automated
visual content annotation and more widely to bridging the semantic gap between
computational approaches of vision and language.
Amr organised the Annual Showcase Event for the School of Computer Science, University of Lincoln. (14th and 15th May). This year, the event also featured the Postgraduates by Research (PGRs) presenting their research work and demonstrating some interactive demos.
Members of DCAPI group have presented and showed their research work in the Annual Showcase Event for the School of Computer Science, University of Lincoln. (14th and 15th May). Saddam also won the “Best Demo” prize for his video matching & retrieval interactive demo. (more details and photos are available on the group’s website at http://dcapi.blogs.lincoln.ac.uk/2014/05/17/pgrs-showcase-event/)
The paper “Compact Signature-based Compressed Video Matching Using Dominant Colour Profiles (DCP)” has been accepted in the ICPR 2014 conference http://www.icpr2014.org/, and will be presented in August 2014, Stockholm, Sweden.
Abstract— This paper presents a technique for efficient and generic matching of compressed video shots, through compact signatures extracted directly without decompression. The compact signature is based on the Dominant Colour Profile (DCP); a sequence of dominant colours extracted and arranged as a sequence of spikes, in analogy to the human retinal representation of a scene. The proposed signature represents a given video shot with ~490 integer values, facilitating for real-time processing to retrieve a maximum set of matching videos. The technique is able to work directly on MPEG compressed videos, without full decompression, as it is utilizing the DC-image as a base for extracting colour features. The DC-image has a highly reduced size, while retaining most of visual aspects, and provides high performance compared to the full I-frame. The experiments and results on various standard datasets show the promising performance, both the accuracy and the efficient computation complexity, of the proposed technique.
Videos, especially the compressed ones, became a major part of our daily life. With the amount of videos growing exponentially, Scientists are being pushed to develop robust tools that could efficiently index and retrieve videos in a way similar to human perception of similarity.
*Based on http://www.youtube.com/yt/press/statistics.html
PROBLEM
§Manual annotation is a hard work and annotations are not always available for utilization.
§We need more smarter tagging process for videos.
§With the increasing of compressed videos, more efficient techniques are required to work directly on compressed files, without need for decompression.
AIM
Our aim is to build a framework that will operate on compressed videos (utilizing the DC-images sequence),
CONCLUSION
•DC-IMAGE is suitable for cheaper computations and could be used as basic building block for real-time processing.
•Local features proved to be effective on DC-image, after our introduced modification.
Congratulations to Dr Amjad Altadmri for completeing his PhD degree. Amjad received his PhD degree in the formal September Graduation Ceremony at the Lincoln Cathedral.
Amjad Graduation Ceremony – September 2013
His PhD titled “Semantic Annotation of Domain-Independent Uncontrolled Videos, Incorporating Visual Similarity and Commonsesne Knowledge Bases”. The work produced a Framework for semantic video annotation. In addition, VisualNet was also produced, which is a semantic Network for Visual-related applications.
The photo shows Dr Amjad Altadmri (Left) with his Director of Studies/Supervisor Dr Amr Ahmed ( right).
Amjad has also participated, with Amr and other members of the DCAPI group, in various workshops especially the V&L EPSRC Network workshops. They presented sessions and showed posters; see related blog posts: