Author Archives: Svebor KARAMAN - Page 2

International Journals

2014

  • [PDF] [DOI] S. Karaman, A. Bagdanov, L. Landucci, G. D’Amico, A. Ferracani, D. Pezzatini, and A. Del Bimbo, “Personalized multimedia content delivery on an interactive table by passive observation of museum visitors,” Multimedia Tools and Applications, pp. 1-25, 2014.
    [Bibtex]
    @article{karaman2014mtap,
    year={2014},
    issn={1380-7501},
    journal={Multimedia Tools and Applications},
    doi={10.1007/s11042-014-2192-y},
    title={Personalized multimedia content delivery on an interactive table by passive observation of museum visitors},
    url={http://dx.doi.org/10.1007/s11042-014-2192-y},
    publisher={Springer US},
    keywords={Computer vision; Video surveillance; Cultural heritage; Multimedia museum; Personalization; Natural interaction; Passive profiling},
    author={Karaman, Svebor and Bagdanov, AndrewD. and Landucci, Lea and D’Amico, Gianpaolo and Ferracani, Andrea and Pezzatini, Daniele and Del Bimbo, Alberto},
    pages={1-25},
    language={English}
    }
  • [PDF] [DOI] S. Karaman, G. Lisanti, A. D. Bagdanov, and A. D. Bimbo, “Leveraging local neighborhood topology for large scale person re-identification,” Pattern Recognition, vol. 47, iss. 12, pp. 3767-3778, 2014.
    [Bibtex]
    @article{karaman2014leveraging,
    title = "Leveraging local neighborhood topology for large scale person re-identification ",
    journal = "Pattern Recognition ",
    volume = "47",
    number = "12",
    pages = "3767 - 3778",
    year = "2014",
    note = "",
    issn = "0031-3203",
    doi = "10.1016/j.patcog.2014.06.003",
    url = "http://www.sciencedirect.com/science/article/pii/S0031320314002258",
    author = "Svebor Karaman and Giuseppe Lisanti and Andrew D. Bagdanov and Alberto Del Bimbo",
    keywords = "Re-Identification",
    keywords = "Conditional Random Field",
    keywords = "Semi-supervised",
    keywords = "\{ETHZ\}",
    keywords = "\{CAVIAR\}",
    keywords = "3DPeS",
    keywords = "\{CMV100\} "
    }

2012

  • [PDF] [DOI] S. Karaman, J. Benois-Pineau, V. Dovgalecs, R. Mégret, J. Pinquier, R. André-Obrecht, Y. Gaëstel, and J. Dartigues, “Hierarchical Hidden Markov Model in detecting activities of daily living in wearable videos for studies of dementia,” Multimedia Tools and Applications (MTAP), vol. 69, iss. 3, p. 1–29, 2012.
    [Bibtex]
    @article{karaman2012hierarchical,
    title={Hierarchical Hidden Markov Model in detecting activities of daily living in wearable videos for studies of dementia},
    author={Karaman, Svebor and Benois-Pineau, Jenny and Dovgalecs, Vladislavs and M{\'e}gret, R{\'e}mi and Pinquier, Julien and Andr{\'e}-Obrecht, R{\'e}gine and Ga{\"e}stel, Yann and Dartigues, Jean-Fran{\c{c}}ois},
    journal={Multimedia Tools and Applications (MTAP)},
    pages={1--29},
    year={2012},
    volume={69},
    number={3},
    doi={10.1007/s11042-012-1117-x},
    publisher={Springer}
    }

International Conferences and Workshops

2015

  • [PDF] A. Ciolini, L. Seidenari, S. Karaman, and A. Del Bimbo, “Efficient Hough Forest Object Detection for Low-power Devices,” in IEEE First International Workshop on Wearable and Ego-vision Systems for Augmented Experience (WEsAX), 2015.
    [Bibtex]
    @inproceedings{ciolini2015,
    author = {Ciolini, Andrea and Seidenari, Lorenzo and Karaman, Svebor and Del Bimbo, Alberto},
    title = {Efficient Hough Forest Object Detection for Low-power Devices},
    booktitle = {IEEE First International Workshop on Wearable and Ego-vision Systems for Augmented Experience (WEsAX)},
    year = {2015}
    }
  • [PDF] F. Bartoli, L. Seidenari, G. Lisanti, S. Karaman, and A. Del Bimbo, “WATSS: a Web Annotation Tool for Surveillance Scenarios,” in ACM Multimedia 2015 Open Source Software Competition, 2015.
    [Bibtex]
    @inproceedings{bartoli2015watss,
    title = {WATSS: a Web Annotation Tool for Surveillance Scenarios},
    author = {Bartoli, Federico and Seidenari, Lorenzo and Lisanti, Giuseppe and Karaman, Svebor and Del Bimbo, Alberto},
    booktitle = {ACM Multimedia 2015 Open Source Software Competition},
    year = {2015}
    }
  • [PDF] F. Bartoli, G. Lisanti, L. Seidenari, S. Karaman, and A. Del Bimbo, “MuseumVisitors: A Dataset for Pedestrian and Group Detection, Gaze Estimation and Behavior Understanding,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2015, p. 19–27.
    [Bibtex]
    @inproceedings{bartoli2015museumvisitors,
    title={MuseumVisitors: A Dataset for Pedestrian and Group Detection, Gaze Estimation and Behavior Understanding},
    author={Bartoli, Federico and Lisanti, Giuseppe and Seidenari, Lorenzo and Karaman, Svebor and Del Bimbo, Alberto},
    booktitle={Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops},
    pages={19--27},
    year={2015}
    }

2014

  • [PDF] S. Karaman, L. Seidenari, S. Ma, A. Del Bimbo, and S. Sclaroff, “Adaptive Structured Pooling for Action Recognition,” in Proc. of British Machine Vision Conference (BMVC), Nottingham, UK, 2014.
    [Bibtex]
    @InProceedings{karamanbmvc2014,
    author = "Karaman, Svebor and Seidenari, Lorenzo and Ma, Shugao and Del Bimbo, Alberto and Sclaroff, Stan",
    title = "Adaptive Structured Pooling for Action Recognition",
    booktitle = "Proc. of British Machine Vision Conference (BMVC)",
    address = "Nottingham, UK",
    year = "2014",
    note = "Poster",
    }
  • [PDF] F. Bartoli, G. Lisanti, S. Karaman, A. D. Bagdanov, and A. Del Bimbo, “Unsupervised scene adaptation for faster multi-scale pedestrian detection,” in 22nd International Conference on Pattern Recognition (ICPR), Stockholm, Sweden, 2014.
    [Bibtex]
    @InProceedings{bartoliicpr2014,
    author = {Bartoli, Federico and Lisanti, Giuseppe and Karaman, Svebor and Bagdanov, Andrew D. and Del Bimbo, Alberto},
    title = {Unsupervised scene adaptation for faster multi-scale pedestrian detection},
    note = {Oral presentation},
    booktitle = {22nd International Conference on Pattern Recognition (ICPR)},
    address = {Stockholm, Sweden},
    year = {2014}
    }

2013

  • [PDF] S. Karaman, L. Seidenari, A. D. Bagdanov, and A. Del Bimbo, “L1-regularized Logistic Regression Stacking and CRF Smoothing for Action Recognition,” in THUMOS: ICCV Workshop on Action Recognition with a Large Number of Classes, 2013.
    [Bibtex]
    @InProceedings{karamanthumos2013,
    author = "Karaman, Svebor and Seidenari, Lorenzo and Bagdanov, Andrew D. and Del Bimbo, Alberto",
    title = "L1-regularized Logistic Regression Stacking and CRF Smoothing for Action Recognition",
    booktitle = "THUMOS: ICCV Workshop on Action Recognition with a Large Number of Classes",
    year = "2013",
    note = {Oral presentation. Ranked #2 of the Action Recognition Challenge}
    }
  • [PDF] [DOI] S. Karaman, A. D. Bagdanov, G. D’Amico, L. Landucci, A. Ferracani, D. Pezzatini, and A. Del Bimbo, “Passive Profiling and Natural Interaction Metaphors for Personalized Multimedia Museum Experiences,” in MM4CH’13 – New Trends in Image Analysis and Processing – ICIAP 2013, Naples, Italy: Springer, 2013, p. 247–256.
    [Bibtex]
    @incollection{karaman2013passive,
    title = {Passive Profiling and Natural Interaction Metaphors for Personalized Multimedia Museum Experiences},
    author = {Karaman, Svebor and Bagdanov, Andrew D and D’Amico, Gianpaolo and Landucci, Lea and Ferracani, Andrea and Pezzatini, Daniele and Del Bimbo, Alberto},
    booktitle = {MM4CH'13 - New Trends in Image Analysis and Processing -- ICIAP 2013},
    doi = {10.1007/978-3-642-41190-8_27},
    pages = {247--256},
    address = {Naples, Italy},
    year = {2013},
    note={Oral Presentation},
    publisher = {Springer}
    }
  • [PDF] [DOI] A. D. Bagdanov, A. Del Bimbo, D. Di Fina, S. Karaman, G. Lisanti, and I. Masi, “Multi-Target Data Association using Sparse Reconstruction,” in Proc. of International Conference on Image Analysis and Processing (ICIAP), Naples, Italy, 2013, pp. 239-248.
    [Bibtex]
    @inproceedings{DBLMKD13,
    author = {Bagdanov, Andrew D. and Del Bimbo, Alberto and Di Fina, Dario and Karaman, Svebor and Lisanti, Giuseppe and Masi, Iacopo},
    title = {Multi-Target Data Association using Sparse Reconstruction},
    booktitle = {Proc. of International Conference on Image Analysis and Processing (ICIAP)},
    year = {2013},
    address = {Naples, Italy},
    pages = {239-248},
    note={Poster},
    doi = {10.1007/978-3-642-41184-7_25},
    publisher = {Springer Berlin Heidelberg},
    keywords = {Data association; multi-target tracking; sparse methods; video surveillance},
    url = {http://www.micc.unifi.it/publications/2013/DBLMKD13}
    }

2012

  • [PDF] [DOI] S. Karaman and A. D. Bagdanov, “Identity Inference: Generalizing Person Re-identification Scenarios,” in Computer Vision – ECCV 2012. Workshops and Demonstrations, A. Fusiello, V. Murino, and R. Cucchiara, Eds., Firenze, Italy: Springer Berlin Heidelberg, 2012, vol. 7583, pp. 443-452.
    [Bibtex]
    @incollection{karamanIdInf2012,
    isbn={978-3-642-33862-5},
    booktitle={Computer Vision – ECCV 2012. Workshops and Demonstrations},
    volume={7583},
    series={Lecture Notes in Computer Science},
    editor={Fusiello, Andrea and Murino, Vittorio and Cucchiara, Rita},
    doi={10.1007/978-3-642-33863-2_44},
    title={Identity Inference: Generalizing Person Re-identification Scenarios},
    url={http://dx.doi.org/10.1007/978-3-642-33863-2_44},
    publisher={Springer Berlin Heidelberg},
    author={Karaman, Svebor and Bagdanov, Andrew D.},
    pages={443-452},
    address = {Firenze, Italy},
    note={Oral Presentation. Best Paper Award},
    year={2012}
    }
  • [PDF] J. Pinquier, S. Karaman, L. Letoupin, P. Guyot, R. Megret, J. Benois-Pineau, Y. Gaestel, and J. -F. Dartigues, “Strategies for multiple feature fusion with Hierarchical HMM: Application to activity recognition from wearable audiovisual sensors,” in 21st International Conference on Pattern Recognition (ICPR), Tsukuba, Japan, 2012, pp. 3192-3195.
    [Bibtex]
    @INPROCEEDINGS{Pinquier2012,
    author={Pinquier, J. and Karaman, S. and Letoupin, L. and Guyot, P. and Megret, R. and Benois-Pineau, J. and Gaestel, Y. and Dartigues, J.-F.},
    booktitle={21st International Conference on Pattern Recognition (ICPR)},
    title={Strategies for multiple feature fusion with Hierarchical HMM: Application to activity recognition from wearable audiovisual sensors},
    year={2012},
    month={Nov},
    pages={3192-3195},
    abstract={In this paper, we further develop the research on recognition of activities, in videos recorded with wearable cameras, with Hierarchical Hidden Markov Model classifiers. The visual scenes being of a strong complexity in terms of motion and visual content, good performances have been obtained using multiple visual and audio cues. The adequate fusion of features from physically different description spaces remains an open issue not only for this particular task, but in multiple problems of pattern recognition. A study of optimal fusion strategies in the HMM framework is proposed. We design and exploit early, intermediate and late fusions with emitting states in the H-HMM. The results obtained on a corpus recorded by healthy volunteers and patients in a longitudinal dementia study allow choosing optimal fusion strategies as a function of target activity.},
    keywords={gesture recognition;hidden Markov models;image fusion;video signal processing;H-HMM;activity recognition;description spaces;early fusions;healthy volunteers;hierarchical HMM classifier;hierarchical hidden Markov model classifiers;intermediate fusions;late fusions;longitudinal dementia study;motion content;multiple feature fusion;optimal fusion strategies;pattern recognition;strong complexity;target activity;visual content;visual scenes;wearable audiovisual sensors;wearable cameras;Cameras;Hidden Markov models;Multimedia communication;Pattern recognition;Streaming media;Videos;Visualization},
    url = {http://ieeexplore.ieee.org/xpl/freeabs_all.jsp?arnumber=6460843},
    note={Poster},
    address = {Tsukuba, Japan},
    ISSN={1051-4651}
    }
  • [PDF] [DOI] S. Karaman, J. Benois-Pineau, R. Mégret, and A. Bugeau, “Multi-layer Local Graph Words for Object Recognition,” in Advances in Multimedia Modeling, K. Schoeffmann, B. Merialdo, A. Hauptmann, C. Ngo, Y. Andreopoulos, and C. Breiteneder, Eds., Klagenfurt, Austria: Springer Berlin Heidelberg, 2012, vol. 7131, pp. 29-39.
    [Bibtex]
    @incollection{karamanMMM2012,
    isbn={978-3-642-27354-4},
    booktitle={Advances in Multimedia Modeling},
    volume={7131},
    series={Lecture Notes in Computer Science},
    editor={Schoeffmann, Klaus and Merialdo, Bernard and Hauptmann, AlexanderG. and Ngo, Chong-Wah and Andreopoulos, Yiannis and Breiteneder, Christian},
    doi={10.1007/978-3-642-27355-1_6},
    title={Multi-layer Local Graph Words for Object Recognition},
    url={http://dx.doi.org/10.1007/978-3-642-27355-1_6},
    publisher={Springer Berlin Heidelberg},
    keywords={Feature representation; Structural features; Bag-of-Visual-Words; Graph Words; Delaunay triangulation; Context Dependent Kernel},
    author={Karaman, Svebor and Benois-Pineau, Jenny and Mégret, Rémi and Bugeau, Aurélie},
    note={Oral Presentation},
    address = {Klagenfurt, Austria},
    pages={29-39},
    year={2012}
    }

2011

  • [PDF] [DOI] S. Karaman, J. Benois-Pineau, R. Mégret, J. Pinquier, Y. Gaestel, and J. -F. Dartigues, “Activities of daily living indexing by hierarchical HMM for dementia diagnostics,” in 9th International Workshop on Content-Based Multimedia Indexing (CBMI), Madrid, Spain, 2011, pp. 79-84.
    [Bibtex]
    @INPROCEEDINGS{karamanCBMI2011,
    author={Karaman, S. and Benois-Pineau, J. and Mégret, R. and Pinquier, J. and Gaestel, Y. and Dartigues, J.-F.},
    booktitle={9th International Workshop on Content-Based Multimedia Indexing (CBMI)},
    title={Activities of daily living indexing by hierarchical HMM for dementia diagnostics},
    year={2011},
    month={June},
    address = {Madrid, Spain},
    pages={79-84},
    abstract={This paper presents a method for indexing human activities in videos captured from a wearable camera being worn by patients, for studies of progression of the dementia diseases. Our method aims to produce indexes to facilitate the navigation throughout the individual video recordings, which could help doctors search for early signs of the disease in the activities of daily living. The recorded videos have strong motion and sharp lighting changes, inducing noise for the analysis. The proposed approach is based on a two steps analysis. First, we propose a new approach to segment this type of video, based on apparent motion. Each segment is characterized by two original motion descriptors, as well as color, and audio descriptors. Second, a Hidden-Markov Model formulation is used to merge the multimodal audio and video features, and classify the test segments. Experiments show the good properties of the approach on real data.},
    keywords={hidden Markov models;image colour analysis;image segmentation;indexing;medical diagnostic computing;medical disorders;video recording;audio descriptors;color descriptors;daily living indexing;dementia diagnostics;dementia diseases;hidden-Markov model formulation;hierarchical HMM;human activities indexing;multimodal audio features;original motion descriptors;recorded videos;test segments;two steps analysis;video features;video recordings;wearable camera;Accuracy;Cameras;Dynamics;Hidden Markov models;Histograms;Motion segmentation;Videos},
    doi={10.1109/CBMI.2011.5972524},
    note={Oral Presentation},
    ISSN={1949-3983}
    }
  • [PDF] Y. Gaëstel, S. Karaman, R. Megret, O. Cherifa, T. Francoise, B. Jenny, and J. Dartigues, “Autonomy at home and early diagnosis in Alzheimer’s Disease: Utility of video indexing applied to clinical issues, the IMMED project,” in Alzheimer’s Association International Conference on Alzheimer’s Disease (AAICAD), Paris, France, 2011, p. S245.
    [Bibtex]
    @inproceedings{gaestel2011,
    hal_id = {hal-00978228},
    url = {http://hal.archives-ouvertes.fr/hal-00978228},
    title = {Autonomy at home and early diagnosis in Alzheimer's Disease: Utility of video indexing applied to clinical issues, the IMMED project},
    author = {Ga{\"e}stel, Yann and Karaman, Svebor and Megret, R{\'e}mi and Cherifa, Onifade-Fagbe and Francoise, Trophy and Jenny, Benois-Pineau and Dartigues, Jean-Fran{\c c}ois},
    abstract = {With ageing of the population in the world, patients with Alzheimer's disease (AD) consequently increase. People suffering from this pathology show early modifications in their "activities of daily living". Those abilities modifications are part of the dementia diagnosis, but are often not reported by the patients or their families. Being able to capture these early signs of autonomy loss could be a way to diagnose earlier dementia and to prevent insecurity at home. We first developed a wearable camera (shoulder mounted) to capture people's activity at home in a non-invasive manner. We then developed a video-indexing methodology to help physicians explore their patients' home-recorded video. This video indexing system requires video and audio analyses to automatically identify and index activities of interest where insecurity or risks could be highlightened. Patients are recruited among the Bagatelle (Talence, France) Memory clinic department patients and are suffering from mild cognitive impairments or very mild AD. We met ten patients at home and we recorded one hour of daily activities for each. The data (video and questionnaires: Activities of Daily Living/Instrumental Activities of Daily Living) are now collected on an extended sample of people suffering from mild cognitive impairments and from very mild AD. We aimed at evaluating behavioral modifications and ability loss detection by comparing the subjects' self reported questionnaires and the video analyses. This project is a successful collaboration between various fields of research. Here, technology is developed to be helpful in everyday challenges that people suffering from dementia of the Alzheimer type are faced with. The automation of the video indexing could be a great step forward in video analysis if it could reduce the time needed to embrace the patient's lifestream, helping in early diagnosis of dementia and becoming a very useful tool to keep individuals safe at home. In fact, many goals could be reached with such video analyses: an early diagnosis of dementia of the Alzheimer type, avoiding danger in home living and evaluating the progression of the disease or the effects of the various therapies (drug-therapy and others).},
    language = {Anglais},
    affiliation = {Institut de Sant{\'e} Publique, d'Epid{\'e}miologie et de D{\'e}veloppement - ISPED , Laboratoire Bordelais de Recherche en Informatique - LaBRI , Laboratoire de l'int{\'e}gration, du mat{\'e}riau au syst{\`e}me - IMS , MSPB Bagatelle - MSPB , Epid{\'e}miologie et Biostatistique},
    booktitle = {{Alzheimer's Association International Conference on Alzheimer's Disease (AAICAD)}},
    pages = {S245},
    address = {Paris, France},
    editor = {Alzheimer's \& Dementia: The Journal of the Alzheimer's Association },
    audience = {internationale },
    note = {Poster presentation. Abstract published in Journal of Alzheimer's and Dementia, volume 7 (4), pp. S245, July 2011},
    collaboration = {IMMED },
    year = {2011},
    month = {Jul}
    }

2010

  • [PDF] [DOI] S. Karaman, J. Benois-Pineau, R. Mégret, V. Dovgalecs, J. -F. Dartigues, and Y. Gaëstel, “Human Daily Activities Indexing in Videos from Wearable Cameras for Monitoring of Patients with Dementia Diseases,” in 20th International Conference on Pattern Recognition (ICPR), Istanbul, Turkey, 2010, pp. 4113-4116.
    [Bibtex]
    @INPROCEEDINGS{karamanICPR2010,
    author={Karaman, S. and Benois-Pineau, J. and Mégret, R. and Dovgalecs, V. and Dartigues, J.-F. and Gaëstel, Y.},
    booktitle={20th International Conference on Pattern Recognition (ICPR)},
    title={Human Daily Activities Indexing in Videos from Wearable Cameras for Monitoring of Patients with Dementia Diseases},
    year={2010},
    month={Aug},
    pages={4113-4116},
    abstract={Our research focuses on analysing human activities according to a known behaviorist scenario, in case of noisy and high dimensional collected data. The data come from the monitoring of patients with dementia diseases by wearable cameras. We define a structural model of video recordings based on a Hidden Markov Model. New spatio-temporal features, color features and localization features are proposed as observations. First results in recognition of activities are promising.},
    keywords={feature extraction;hidden Markov models;image colour analysis;image motion analysis;video cameras;video recording;video signal processing;activity recognition;behaviorist scenario;color features;dementia disease patients;hidden Markov model;human activity indexing;localization features;patient monitoring;spatiotemporal features;video recordings;wearable cameras;Biomedical monitoring;Cameras;Hidden Markov models;Histograms;Image color analysis;Motion segmentation;Videos;Bag of Features;HMM;Localization;Monitoring;Video Indexing},
    doi={10.1109/ICPR.2010.999},
    note={Oral Presentation},
    ISSN={1051-4651},
    address={Istanbul, Turkey}
    }
  • [PDF] [DOI] R. Mégret, V. Dovgalecs, H. Wannous, S. Karaman, J. Benois-Pineau, E. El Khoury, J. Pinquier, P. Joly, R. André-Obrecht, Y. Gaëstel, and J. Dartigues, “The IMMED Project: Wearable Video Monitoring of People with Age Dementia,” in Proceedings of the International Conference on Multimedia (ACMMM), Firenze, Italy, 2010, p. 1299–1302.
    [Bibtex]
    @inproceedings{Megret2010,
    author = {M{\'e}gret, R{\'e}mi and Dovgalecs, Vladislavs and Wannous, Hazem and Karaman, Svebor and Benois-Pineau, Jenny and El Khoury, Elie and Pinquier, Julien and Joly, Philippe and Andr{\'e}-Obrecht, R{\'e}gine and Ga\"{e}stel, Yann and Dartigues, Jean-Fran\c{c}ois},
    title = {The IMMED Project: Wearable Video Monitoring of People with Age Dementia},
    booktitle = {Proceedings of the International Conference on Multimedia (ACMMM)},
    series = {MM '10},
    year = {2010},
    isbn = {978-1-60558-933-6},
    address = {Firenze, Italy},
    pages = {1299--1302},
    numpages = {4},
    url = {http://doi.acm.org/10.1145/1873951.1874206},
    doi = {10.1145/1873951.1874206},
    acmid = {1874206},
    note = {Video program},
    publisher = {ACM},
    keywords = {audio and video indexing, patient monitoring, wearable camera}
    }

PhD Thesis

The research of my PhD thesis [1] was fulfilled in the context of wearable video monitoring of patients with aged dementia. The idea was to provide a new tool to medical practitioners for the early diagnosis of elderly dementia such as the Alzheimer disease [2]. More precisely, Instrumental Activities of Daily Living (IADL) had to be indexed in videos recorded with a wearable recording device.

Such videos present specific characteristics i.e. strong motion or strong lighting changes. Furthermore, the tackled recognition task is of a very strong semantics. In this difficult context, the first step of analysis was to define an equivalent to the notion of “shots” in edited videos. We therefore developed a method for partitioning continuous video streams into viewpoints according to the observed motion in the image plane [3]. For the recognition of IADLs we developed a solution based on the formalism of Hidden Markov Models (HMM) [4]. A hierarchical HMM with two levels modeling semantic activities or intermediate states has been introduced [5]. A complex set of features (dynamic, static, low-level, mid-level) was proposed and the most effective description spaces were identified experimentally [6].

In the mid-level features for activities recognition we focused on the semantic objects the person manipulates in the camera view. We proposed a new concept for object/image description using local features (SURF) and the underlying semi-local connected graphs. We introduced a nested approach for graphs construction when the same scene can be described by levels of graphs with increasing number of nodes. We build these graphs with Delaunay triangulation on SURF points thus preserving good properties of local features i.e. the invariance with regard to affine transformation of image plane: rotation, translation and zoom. We use the graph features in the Bag-of-Visual-Words framework, hence introducing the Graph Words [7]. The problem of distance or dissimilarity definition between graphs for clustering or recognition is obviously arisen. We propose a dissimilarity measure based on the Context Dependent Kernel of H. Sahbi and show its relation with the classical entry-wise norm when comparing trivial graphs (SURF points).

Related publications

[1] [pdf] S. Karaman, “Indexing of Activities in Wearable Videos : Application to Epidemiological Studies of Aged Dementia,” PhD Thesis, 2011.
[Bibtex]
@phdthesis{karaman2011phd,
title={Indexing of Activities in Wearable Videos : Application to Epidemiological Studies of Aged Dementia},
author={Karaman, Svebor},
year={2011},
school={Universit{\'e} Sciences et Technologies-Bordeaux I}
}
[2] [pdf] Y. Gaëstel, S. Karaman, R. Megret, O. Cherifa, T. Francoise, B. Jenny, and J. Dartigues, “Autonomy at home and early diagnosis in Alzheimer’s Disease: Utility of video indexing applied to clinical issues, the IMMED project,” in Alzheimer’s Association International Conference on Alzheimer’s Disease (AAICAD), Paris, France, 2011, p. S245.
[Bibtex]
@inproceedings{gaestel2011,
hal_id = {hal-00978228},
url = {http://hal.archives-ouvertes.fr/hal-00978228},
title = {Autonomy at home and early diagnosis in Alzheimer's Disease: Utility of video indexing applied to clinical issues, the IMMED project},
author = {Ga{\"e}stel, Yann and Karaman, Svebor and Megret, R{\'e}mi and Cherifa, Onifade-Fagbe and Francoise, Trophy and Jenny, Benois-Pineau and Dartigues, Jean-Fran{\c c}ois},
abstract = {With ageing of the population in the world, patients with Alzheimer's disease (AD) consequently increase. People suffering from this pathology show early modifications in their "activities of daily living". Those abilities modifications are part of the dementia diagnosis, but are often not reported by the patients or their families. Being able to capture these early signs of autonomy loss could be a way to diagnose earlier dementia and to prevent insecurity at home. We first developed a wearable camera (shoulder mounted) to capture people's activity at home in a non-invasive manner. We then developed a video-indexing methodology to help physicians explore their patients' home-recorded video. This video indexing system requires video and audio analyses to automatically identify and index activities of interest where insecurity or risks could be highlightened. Patients are recruited among the Bagatelle (Talence, France) Memory clinic department patients and are suffering from mild cognitive impairments or very mild AD. We met ten patients at home and we recorded one hour of daily activities for each. The data (video and questionnaires: Activities of Daily Living/Instrumental Activities of Daily Living) are now collected on an extended sample of people suffering from mild cognitive impairments and from very mild AD. We aimed at evaluating behavioral modifications and ability loss detection by comparing the subjects' self reported questionnaires and the video analyses. This project is a successful collaboration between various fields of research. Here, technology is developed to be helpful in everyday challenges that people suffering from dementia of the Alzheimer type are faced with. The automation of the video indexing could be a great step forward in video analysis if it could reduce the time needed to embrace the patient's lifestream, helping in early diagnosis of dementia and becoming a very useful tool to keep individuals safe at home. In fact, many goals could be reached with such video analyses: an early diagnosis of dementia of the Alzheimer type, avoiding danger in home living and evaluating the progression of the disease or the effects of the various therapies (drug-therapy and others).},
language = {Anglais},
affiliation = {Institut de Sant{\'e} Publique, d'Epid{\'e}miologie et de D{\'e}veloppement - ISPED , Laboratoire Bordelais de Recherche en Informatique - LaBRI , Laboratoire de l'int{\'e}gration, du mat{\'e}riau au syst{\`e}me - IMS , MSPB Bagatelle - MSPB , Epid{\'e}miologie et Biostatistique},
booktitle = {{Alzheimer's Association International Conference on Alzheimer's Disease (AAICAD)}},
pages = {S245},
address = {Paris, France},
editor = {Alzheimer's \& Dementia: The Journal of the Alzheimer's Association },
audience = {internationale },
note = {Poster presentation. Abstract published in Journal of Alzheimer's and Dementia, volume 7 (4), pp. S245, July 2011},
collaboration = {IMMED },
year = {2011},
month = {Jul}
}
[3] [pdf] [doi] S. Karaman, J. Benois-Pineau, R. Mégret, J. Pinquier, Y. Gaestel, and J. -F. Dartigues, “Activities of daily living indexing by hierarchical HMM for dementia diagnostics,” in 9th International Workshop on Content-Based Multimedia Indexing (CBMI), Madrid, Spain, 2011, pp. 79-84.
[Bibtex]
@INPROCEEDINGS{karamanCBMI2011,
author={Karaman, S. and Benois-Pineau, J. and Mégret, R. and Pinquier, J. and Gaestel, Y. and Dartigues, J.-F.},
booktitle={9th International Workshop on Content-Based Multimedia Indexing (CBMI)},
title={Activities of daily living indexing by hierarchical HMM for dementia diagnostics},
year={2011},
month={June},
address = {Madrid, Spain},
pages={79-84},
abstract={This paper presents a method for indexing human activities in videos captured from a wearable camera being worn by patients, for studies of progression of the dementia diseases. Our method aims to produce indexes to facilitate the navigation throughout the individual video recordings, which could help doctors search for early signs of the disease in the activities of daily living. The recorded videos have strong motion and sharp lighting changes, inducing noise for the analysis. The proposed approach is based on a two steps analysis. First, we propose a new approach to segment this type of video, based on apparent motion. Each segment is characterized by two original motion descriptors, as well as color, and audio descriptors. Second, a Hidden-Markov Model formulation is used to merge the multimodal audio and video features, and classify the test segments. Experiments show the good properties of the approach on real data.},
keywords={hidden Markov models;image colour analysis;image segmentation;indexing;medical diagnostic computing;medical disorders;video recording;audio descriptors;color descriptors;daily living indexing;dementia diagnostics;dementia diseases;hidden-Markov model formulation;hierarchical HMM;human activities indexing;multimodal audio features;original motion descriptors;recorded videos;test segments;two steps analysis;video features;video recordings;wearable camera;Accuracy;Cameras;Dynamics;Hidden Markov models;Histograms;Motion segmentation;Videos},
doi={10.1109/CBMI.2011.5972524},
note={Oral Presentation},
ISSN={1949-3983}
}
[4] [pdf] [doi] S. Karaman, J. Benois-Pineau, R. Mégret, V. Dovgalecs, J. -F. Dartigues, and Y. Gaëstel, “Human Daily Activities Indexing in Videos from Wearable Cameras for Monitoring of Patients with Dementia Diseases,” in 20th International Conference on Pattern Recognition (ICPR), Istanbul, Turkey, 2010, pp. 4113-4116.
[Bibtex]
@INPROCEEDINGS{karamanICPR2010,
author={Karaman, S. and Benois-Pineau, J. and Mégret, R. and Dovgalecs, V. and Dartigues, J.-F. and Gaëstel, Y.},
booktitle={20th International Conference on Pattern Recognition (ICPR)},
title={Human Daily Activities Indexing in Videos from Wearable Cameras for Monitoring of Patients with Dementia Diseases},
year={2010},
month={Aug},
pages={4113-4116},
abstract={Our research focuses on analysing human activities according to a known behaviorist scenario, in case of noisy and high dimensional collected data. The data come from the monitoring of patients with dementia diseases by wearable cameras. We define a structural model of video recordings based on a Hidden Markov Model. New spatio-temporal features, color features and localization features are proposed as observations. First results in recognition of activities are promising.},
keywords={feature extraction;hidden Markov models;image colour analysis;image motion analysis;video cameras;video recording;video signal processing;activity recognition;behaviorist scenario;color features;dementia disease patients;hidden Markov model;human activity indexing;localization features;patient monitoring;spatiotemporal features;video recordings;wearable cameras;Biomedical monitoring;Cameras;Hidden Markov models;Histograms;Image color analysis;Motion segmentation;Videos;Bag of Features;HMM;Localization;Monitoring;Video Indexing},
doi={10.1109/ICPR.2010.999},
note={Oral Presentation},
ISSN={1051-4651},
address={Istanbul, Turkey}
}
[5] [pdf] [doi] S. Karaman, J. Benois-Pineau, V. Dovgalecs, R. Mégret, J. Pinquier, R. André-Obrecht, Y. Gaëstel, and J. Dartigues, “Hierarchical Hidden Markov Model in detecting activities of daily living in wearable videos for studies of dementia,” Multimedia Tools and Applications (MTAP), vol. 69, iss. 3, p. 1–29, 2012.
[Bibtex]
@article{karaman2012hierarchical,
title={Hierarchical Hidden Markov Model in detecting activities of daily living in wearable videos for studies of dementia},
author={Karaman, Svebor and Benois-Pineau, Jenny and Dovgalecs, Vladislavs and M{\'e}gret, R{\'e}mi and Pinquier, Julien and Andr{\'e}-Obrecht, R{\'e}gine and Ga{\"e}stel, Yann and Dartigues, Jean-Fran{\c{c}}ois},
journal={Multimedia Tools and Applications (MTAP)},
pages={1--29},
year={2012},
volume={69},
number={3},
doi={10.1007/s11042-012-1117-x},
publisher={Springer}
}
[6] [pdf] J. Pinquier, S. Karaman, L. Letoupin, P. Guyot, R. Megret, J. Benois-Pineau, Y. Gaestel, and J. -F. Dartigues, “Strategies for multiple feature fusion with Hierarchical HMM: Application to activity recognition from wearable audiovisual sensors,” in 21st International Conference on Pattern Recognition (ICPR), Tsukuba, Japan, 2012, pp. 3192-3195.
[Bibtex]
@INPROCEEDINGS{Pinquier2012,
author={Pinquier, J. and Karaman, S. and Letoupin, L. and Guyot, P. and Megret, R. and Benois-Pineau, J. and Gaestel, Y. and Dartigues, J.-F.},
booktitle={21st International Conference on Pattern Recognition (ICPR)},
title={Strategies for multiple feature fusion with Hierarchical HMM: Application to activity recognition from wearable audiovisual sensors},
year={2012},
month={Nov},
pages={3192-3195},
abstract={In this paper, we further develop the research on recognition of activities, in videos recorded with wearable cameras, with Hierarchical Hidden Markov Model classifiers. The visual scenes being of a strong complexity in terms of motion and visual content, good performances have been obtained using multiple visual and audio cues. The adequate fusion of features from physically different description spaces remains an open issue not only for this particular task, but in multiple problems of pattern recognition. A study of optimal fusion strategies in the HMM framework is proposed. We design and exploit early, intermediate and late fusions with emitting states in the H-HMM. The results obtained on a corpus recorded by healthy volunteers and patients in a longitudinal dementia study allow choosing optimal fusion strategies as a function of target activity.},
keywords={gesture recognition;hidden Markov models;image fusion;video signal processing;H-HMM;activity recognition;description spaces;early fusions;healthy volunteers;hierarchical HMM classifier;hierarchical hidden Markov model classifiers;intermediate fusions;late fusions;longitudinal dementia study;motion content;multiple feature fusion;optimal fusion strategies;pattern recognition;strong complexity;target activity;visual content;visual scenes;wearable audiovisual sensors;wearable cameras;Cameras;Hidden Markov models;Multimedia communication;Pattern recognition;Streaming media;Videos;Visualization},
url = {http://ieeexplore.ieee.org/xpl/freeabs_all.jsp?arnumber=6460843},
note={Poster},
address = {Tsukuba, Japan},
ISSN={1051-4651}
}
[7] [pdf] [doi] S. Karaman, J. Benois-Pineau, R. Mégret, and A. Bugeau, “Multi-layer Local Graph Words for Object Recognition,” in Advances in Multimedia Modeling, K. Schoeffmann, B. Merialdo, A. Hauptmann, C. Ngo, Y. Andreopoulos, and C. Breiteneder, Eds., Klagenfurt, Austria: Springer Berlin Heidelberg, 2012, vol. 7131, pp. 29-39.
[Bibtex]
@incollection{karamanMMM2012,
isbn={978-3-642-27354-4},
booktitle={Advances in Multimedia Modeling},
volume={7131},
series={Lecture Notes in Computer Science},
editor={Schoeffmann, Klaus and Merialdo, Bernard and Hauptmann, AlexanderG. and Ngo, Chong-Wah and Andreopoulos, Yiannis and Breiteneder, Christian},
doi={10.1007/978-3-642-27355-1_6},
title={Multi-layer Local Graph Words for Object Recognition},
url={http://dx.doi.org/10.1007/978-3-642-27355-1_6},
publisher={Springer Berlin Heidelberg},
keywords={Feature representation; Structural features; Bag-of-Visual-Words; Graph Words; Delaunay triangulation; Context Dependent Kernel},
author={Karaman, Svebor and Benois-Pineau, Jenny and Mégret, Rémi and Bugeau, Aurélie},
note={Oral Presentation},
address = {Klagenfurt, Austria},
pages={29-39},
year={2012}
}

National Conferences

  • [PDF] S. Karaman, J. Benois-Pineau, R. Mégret, and A. Bugeau, “Mots visuels issus de graphes locaux multi-niveaux pour la reconnaissance d’objets,” in Actes de la conférence RFIA 2012, Lyon, France, 2012, p. 978-2-9539515-2-3.
    [Bibtex]
    @inproceedings{karamanRFIA2012,
    hal_id = {hal-00656516},
    url = {http://hal.archives-ouvertes.fr/hal-00656516},
    title = {Mots visuels issus de graphes locaux multi-niveaux pour la reconnaissance d'objets},
    author = {Karaman, Svebor and Benois-Pineau, Jenny and M{\'e}gret, R{\'e}mi and Bugeau, Aur{\'e}lie},
    abstract = {{Dans cet article, nous nous int{\'e}ressons au probl{\`e}me ouvert {\`a} ce jour en indexation et recherche d'images {\`a} savoir la reconnaissance des objets. Depuis l'apparition de l'approche par des " Sacs-de-descripteurs " et ensuite des " Sac-de-mots ", la " d{\'e}structuration " de la description des images en utilisant des ensembles non structur{\'e}s de caract{\'e}ristiques a {\'e}t{\'e} contr{\'e}e par l'introduction de diff{\'e}rents groupements de descripteurs locaux ou encore par l'introduction de la topologie. Ainsi la reconnaissance d'objets peut {\^e}tre vue {\`a} ce jour comme le retour, {\`a} un autre niveau et avec d'autres outils, {\`a} la d{\'e}marche structurelle. Les caract{\'e}ristiques structurelles que nous proposons pour la reconnaissance d'objets sont les graphes locaux multi-niveaux embo{\^\i}t{\'e}s {\'e}tablis sur des ensembles de points SURF avec la triangulation de Delaunay. Cette repr{\'e}sentation conserve l'invariance aux transformations g{\'e}om{\'e}triques du plan-image inh{\'e}rente aux descripteurs SIFT/SURF. Une approche de type sac de mots visuels est appliqu{\'e}e sur ces graphes, donnant naissance {\`a} une repr{\'e}sentation de sacs de mots issus de graphes locaux. La construction des graphes locaux op{\`e}re par niveaux successifs, depuis les graphes de Delaunay {\'e}l{\'e}mentaires - les points SURF isol{\'e}s - en augmentant le nombre de n{\oe}uds {\`a} chaque couche. Pour chaque niveau de graphes un dictionnaire visuel distinct est {\'e}tabli. Les exp{\'e}riences entreprises sur les ensembles de donn{\'e}es SIVAL et Caltech-101 indiquent que les graphes multi-niveaux ont des performances compl{\'e}mentaires sur chaque niveau et que leur combinaison am{\'e}liore les performances par rapport {\`a} l'approche par sacs de mots visuels}},
    keywords = {Repr{\'e}sentation par primitives visuelles, Caract{\'e}ristiques structurelles, Sac-de-Mots-Visuels, Graphes de mots visuels, Triangulation de Delaunay, Noyau d{\'e}pendant du contexte},
    language = {French},
    affiliation = {Laboratoire Bordelais de Recherche en Informatique - LaBRI , Laboratoire de l'int{\'e}gration, du mat{\'e}riau au syst{\`e}me - IMS},
    booktitle = {{Actes de la conf{\'e}rence RFIA 2012}},
    pages = {978-2-9539515-2-3},
    address = {Lyon, France},
    note = {Oral presentation},
    note = {Session "Articles" },
    audience = {national },
    year = {2012},
    month = {Jan}
    }

Reviewing activity

I have been a reviewer for the following referred publications:

  • Journals: Multimedia Tools And Applications, Advances in Artificial Intelligence, IEEE Transactions on Circuits and Systems for Video Technology.
  • International Conferences: ACMMM’12, ACMMM’14, BMVC’13, CIARP’13, HPCC’13, ICIAP’13, ICIP’13, ICPR’14, ISPA’13, PSIVT’13, WSDM’13, WSDM’14.
  • International Workshops: ARTEMIS’11, ARTEMIS’12, ARTEMIS’13, MM4CH’13.
  • National Conference: GRETSI’13.