Research Archives - Page 2 of 4 - Professor Amr Ahmed

August 30, 2013

Best Student Paper Award 2013 – WCE 2013

Congratulations to Saddam Bekhet (PhD Researcher) who achieved the “Best Student Paper Award 2013″ for his conference paper entitled “Video Matching Using DC-image and Local Features ” presented earlier in “World Congress on Engineering 2013“ in London .

Abstract: This paper presents a suggested framework for video matching based on local features extracted from the DC-image of MPEG compressed videos, without decompression. The relevant arguments and supporting evidences are discussed for developing video similarity techniques that works directly on compressed videos, without decompression, and especially utilising small size images. Two experiments are carried to support the above. The first is comparing between the DC-image and I-frame, in terms of matching performance and the corresponding computation complexity. The second experiment compares between using local features and global features in video matching, especially in the compressed domain and with the small size images. The results confirmed that the use of DC-image, despite its highly reduced size, is promising as it produces at least similar (if not better) matching precision, compared to the full I-frame. Also, using SIFT, as a local feature, outperforms precision of most of the standard global features. On the other hand, its computation complexity is relatively higher, but it is still within the real-time margin. There are also various optimisations that can be done to improve this computation complexity.

July 11, 2013

Conference paper presented WCE’13 – 3rd July 2013 – London

The paper (titled “Video Matching Using DC-image and Local Features”) was presented by Saddam Bekhet (PhD Rsearcher) in the International Conference of Signal and Image Engineering (ICSIE’13), during the World Congress on Engineering 2013, in London UK.

Abstract:

This paper presents a suggested framework for video matching based on local features extracted from the DC-image of MPEG compressed videos, without decompression. The relevant arguments and supporting evidences are discussed for developing video similarity techniques that works directly on compressed videos, without decompression, and especially utilising small size images. Two experiments are carried to support the above. The first is comparing between the DC-image and I-frame, in terms of matching performance and the corresponding computation complexity. The second experiment compares between using local features and global features in video matching, especially in the compressed domain and with the small size images. The results confirmed that the use of DC-image, despite its highly reduced size, is promising as it produces at least similar (if not better) matching precision, compared to the full I-frame. Also, using SIFT, as a local feature, outperforms precision of most of the standard global features. On the other hand, its computation complexity is relatively higher, but it is still within the real-time margin. There are also various optimisations that can be done to improve this computation complexity.

Well done Saddam.

June 18, 2013

Conference paper Accepted to the “World Congress on Engineering”

New Conference paper accepted for publishing in “World Congress on Engineering 2013“.

The paper title is “Video Matching Using DC-image and Local Features ”

Abstract:

March 16, 2013

Automatic Semantic Video Annotation

Amjad Altadmri, Amr Ahmed*, Andrew Hunter

Poster - Click here to download PDF. — Poster – see link below to download PDF.

(Click Semantic Video Annotation-with Knowledge ” http://amrahmed.blogs.lincoln.ac.uk/files/2013/03/Semantic-Video-Annotation-with-Knowledge.pdf , to download the pdf)

INTRODUCTION

The volume of video data is growing exponentially. This data need to be annotated to facilitate search and retrieval, so that we can quickly find a video whenever needed.

Manual Annotation, especially for such volume, is time consuming and would be expensive. Hence, automated annotation systems are required.

AIM

Automated Semantic Annotation of wide-domain videos (i.e. no domain restrictions). This is an important step towards bridging the “Semantic Gap” in video understanding.

METHOD

1. Extracting “Video Signature” for each video.

2. Match signatures to find most similar videos, with annotations

3. Analyse and process obtained annotations, in consultation with Common-sense knowledge-bases

4. Produce the suggested annotation.

EVALUATION

• Two standard, and challenging Datasets were used. TRECVID BBC Rush and UCF.

• Black-box and White-box testing carried out.

•Measures include: Precision, Confusion Matrix.

CONCLUSION

•Developed an Automatic Semantic Video Annotation framework.

•Not restricted to a specific domain videos.

•Utilising Common-sense Knowledge enhances scene understanding and improve semantic annotation.

Publications

A framework for automatic semantic video annotation
Altadmri, Amjad and Ahmed, Amr (2013) A framework for automatic semantic video annotation. Multimedia Tools and Applications, 64 (2). ISSN 1380-7501.
Semantic levels of domain-independent commonsense knowledgebase for visual indexing and retrieval applications
Altadmri, Amjad and Ahmed, Amr and Mohtasseb Billah, Haytham (2012) Semantic levels of domain-independent commonsense knowledgebase for visual indexing and retrieval applications. Neural Information Processing. Lecture Notes in Computer Science, 7663 . pp. 640-647. ISSN 0302-9743
VisualNet: commonsense knowledgebase for video and image indexing and retrieval application
Alabdullah Altadmri, Amjad and Ahmed, Amr (2009) VisualNet: commonsense knowledgebase for video and image indexing and retrieval application. In: IEEE International Conference on Intelligent Computing and Intelligent Systems, 21-22 November 2009, Shanghai, China..
Automatic semantic video annotation in wide domain videos based on similarity and commonsense knowledgebases
Altadmri, Amjad and Ahmed, Amr (2009) Automatic semantic video annotation in wide domain videos based on similarity and commonsense knowledgebases. In: The IEEE International Conference on Signal and Image Processing Applications (ICSIPA 2009), 18-19th November 2009, Malaysia.
Video databases annotation enhancing using commonsense knowledgebases for indexing and retrieval
Altadmri, Amjad and Ahmed, Amr (2009) Video databases annotation enhancing using commonsense knowledgebases for indexing and retrieval. In: The 13th IASTED International Conference on Artificial Intelligence and Soft Computing., September 7 ï¿½ 9, 2009, Palma de Mallorca, Spain.

February 27, 2013

Amjad Altadmri – PhD

Amjad Altadmri has passed his PhD viva, subject to minor amendments, earlier today.

Thesis Title: “Semantic Video Annotation in Domain-Independent Videos Utilising Similarity and Commonsense Knowledgebases”

Thanks to the external, Dr John Wood from the University of Essex, the internal Dr Bashir Al-Diri and the viva chair, Dr Kun Guo.

Congratulations and Well done.

All colleagues are invited to join Amjad on celebrating his achievement, tomorrow (Thursday 28th Feb) at 12:00noon, in our meeting room MC3108, with some drinks and light refreshments available.

Best wishes.

January 20, 2013

New Journal paper Accepted to the “Multimedia Tools and Applications”

New Journal paper accepted for publishing in the Journal of “Multimedia Tools and Applications“.

The paper title is “A Framework for Automatic Semantic Video Annotation utilising Similarity and Commonsense Knowledgebases”

Abstract:

The rapidly increasing quantity of publicly available videos has driven research into developing automatic tools for indexing, rating, searching and retrieval. Textual semantic representations, such as tagging, labelling and annotation, are often important factors in the process of indexing any video, because of their user-friendly way of representing the semantics appropriate for search and retrieval. Ideally, this annotation should be inspired by the human cognitive way of perceiving and of describing videos. The difference between the low-level visual contents and the corresponding human perception is referred to as the ‘semantic gap’. Tackling this gap is even harder in the case of unconstrained videos, mainly due to the lack of any previous information about the analyzed video on the one hand, and the huge amount of generic knowledge required on the other.

This paper introduces a framework for the Automatic Semantic Annotation of unconstrained videos. The proposed framework utilizes two non-domain-specific layers: low-level visual similarity matching, and an annotation analysis that employs commonsense knowledgebases. Commonsense ontology is created by incorporating multiple-structured semantic relationships. Experiments and black-box tests are carried out on standard video databases for
action recognition and video information retrieval. White-box tests examine the performance of the individual intermediate layers of the framework, and the evaluation of the results and the statistical analysis show that integrating visual similarity matching with commonsense semantic relationships provides an effective approach to automated video annotation.

Well done and congratulations to Amjad Altadmri .

December 22, 2012

Two Presentations and Posters in the Vision & Language Network workshop

Three members of the Lincoln School of Computer Science, and the DCAPI group, have attended the Vision & Language (V&L) Network workshop, 13-14th Dec. 2012 in Sheffield, UK.

Amr Ahmed, Amjad Al-tadmri and Deema AbdalHafeth attended the event, where Amjad and Deema delivered 2 oral presentations and 2 posters about their research work:

1. VisualNet: Semantic Commonsense Knowledgebase for Visual Applications

2. Investigating text analysis of user-generated contents for health related applications

Abstracts are available on ( http://www.vlnet.org.uk/VLW12/VLW-2012-Accepted-Abstracts.html)

Congratulations for all involved.

Amjad Altadmri and Amr Ahmed around their poster at the Vision & Language Net workshop, 13-14th Dec 2012, Sheffield, UK.

Deema AbdalHafeth and Amr Ahmed at the Vision & Language Net workshop, 13-14th Dec 2012, Sheffield, UK.

The event included tutorial sessions (Vision for language people, and language for vision people). We had an increased presence this year.

Last year, we had a good presence in the last year’s workshop (http://amrahmed.blogs.lincoln.ac.uk/2011/09/19/vl-network-workshop-brighton/), had good discussions and useful feedback on the presented work.

Looking forward for similar, if not even better, experience this year.

Best wishes for the presentations.

December 2, 2012

Two posters & presentations accepted for the V&L Net Workshop, Dec. 2012

We just had 2 posters and oral presentations accepted for the coming Vision & Language (V&L) Network workshop, 13-14th Dec. 2012 in Sheffield, UK. This is a good representation from Lincoln (and from the DCAPI group).

1. VisualNet: Semantic Commonsense Knowledgebase for Visual Applications

2. Investigating text analysis of user-generated contents for health related applications

Congratulations for all involved.

We had a good presence in the last year’s workshop (http://amrahmed.blogs.lincoln.ac.uk/2011/09/19/vl-network-workshop-brighton/), had good discussions and useful feedback on the presented work.

Looking forward for similar, if not even better, experience this year.

Best wishes for the presentations.

August 14, 2012

Posters & Presentations in the Gerontology Conference (ISG*ISARC2012, Eindhoven)

Amr Ahmed has attended the International Conference of the Society of Gerontology, Technishe Universiteit Eindhoven (TU/e), Eindhevn, Netherlands. He, with Dr Chris Liam (Surrey), presented a poster with oral presentation in relation to the SUS-IT project outcomes. Amr also had another poster in relation to the Intelligent Mobility Scooter Navigation project (funded by the iNET Transport).

August 6, 2012

Presented the DHS paper in the CGIM’12, Crete

Amr presented a paper to the IASTED CGIM’12 conference about the DHS surgeon virtual training project. This paper reports on the development of the 2 3D tracking prototypes for virtual reality training of surgeons (in vitro / Off patient), especially for the Dynamic Hip Screw surgical procedure (in particular; the insertion of the guide-wire). The aim is to develop the cognitive coordination, in particular the Brain/Hands/Eyes coordination that is crucial for such procedure. But through an affordable system that uses Commercial off-the-shelf (COTs) components.

This work is in collaboration with Prof. Maqsood, Consultant Trauma and Orthopaedic surgeon in the Lincoln Hospital.

Project in the media/press:

More information and images: http://amrahmed.blogs.lincoln.ac.uk/2009/11/04/masterig-dhs/