Conference Booklet: Download it here
Program Outline
Conference Day 1, January 8th, 2019 | ||||
---|---|---|---|---|
08:30-09:30 | Registration 1st floor foyer |
|||
09:30-11:00 | Tutorial 1 Poseidon A+C (1st floor) | MANPU Workshop Poseidon B (1st floor) |
||
11:00-11:15 | Coffee break 1st floor foyer |
|||
11:15-12:45 | Tutorial 1 (cont.) Poseidon A+C (1st floor) | MANPU Workshop Poseidon B (1st floor) |
||
12:45-13:45 | Lunch break 1st floor foyer |
|||
13:45-15:15 | Tutorial 2 Poseidon A+C (1st floor) | MANPU Workshop Poseidon B (1st floor) | VBS rehearsal and closed session Dias (7th floor) |
|
15:15-15:30 | Coffee break 1st floor foyer |
|||
15:30-17:00 | Tutorial 2 Poseidon A+C (1st floor) | MANPU Workshop Poseidon B (1st floor) | VBS rehearsal and closed session Dias (7th floor) |
Conference Day 2, January 9th, 2019 | ||
---|---|---|
08:00-09:00 | Registration 7th floor foyer |
|
09:00-09:20 | Conference Opening Dias (7th floor) |
|
09:20-10:20 | Keynote Talk 1 Dias (7th floor) |
|
10:20-10:50 | Coffee break 7th floor foyer |
|
10:50-12:30 | Oral Session 1 Best Paper Session Dias (7th floor) |
|
12:30-13:30 | Lunch break 7th floor foyer |
|
13:30-15:10 | Oral Session 2A Special Session 2 – MAPTA Poseidon A+C (1st floor) | Oral Session 2B 3D & VR Dias (7th floor) |
15:10-16:00 | Coffee break 7th floor foyer | Video Browser Showdown Setup Dias (7th floor) |
16:00-19:00 | Video Browser Showdown & welcome reception Dias (7th floor) |
Conference Day 3, January 10th, 2019 | |||||
---|---|---|---|---|---|
09:00-09:20 | Registration 7th floor foyer |
||||
09:20-10:20 | Keynote Talk 2 Dias (7th floor) |
||||
10:20-10:50 | Coffee break 7th floor foyer & 1st floor foyer |
||||
10:50-12:30 | Oral Session 3A Special Session 1 – PDAL Poseidon A+C (1st floor) | Oral Session 3B MM Indexing and Mining Dias (7th floor) |
|||
12:30-13:30 | Lunch break 7th floor foyer |
||||
13:30-15:10 | Oral Session 4A Special Session 3 – MDRE Poseidon A+C (1st floor) | Oral Session 4B Deep Learning & Applications Dias (7th floor) |
|||
15:10-15:40 | Coffee break 7th floor foyer |
||||
15:40-17:20 | Poster Session 1 Posters Dias (7th floor) |
||||
17:45-20:00 | Walking tour (starting from the conference venue and ending at the conference dinner venue) | ||||
20:00-23:00 | Conference dinner 'Emilios Riadis' hall, in the HELEXPO exhibition space (about 30’ walk from the conference venue) |
Conference Day 4, January 11th, 2019 | ||
---|---|---|
09:00-09:20 | Registration 7th floor foyer |
|
09:20-10:20 | Keynote Talk 3 Dias (7th floor) |
|
10:20-10:50 | Coffee break 7th floor foyer & 1st floor foyer |
|
10:50-12:30 | Oral Session 5A Special Session 4 – CTA Poseidon A+C (1st floor) | Oral Session 5B Audio & Speech Dias (7th floor) |
12:30-13:30 | Lunch break 7th floor foyer |
|
13:30-15:10 | Oral Session 6A Special Session 5 – TCMA Poseidon A+C (1st floor) | Oral Session 6B Industry Session Dias (7th floor) |
15:10-15:40 | Coffee break 7th floor foyer |
|
15:40-17:20 | Poster Session 2 Posters and Demos (VBS systems will also be demonstrated in this session) Dias (7th floor) |
|
17:20-17:30 | Conference Closing |
Detailed Program
Tuesday, January 8th, 2019 | |
---|---|
09:30-12:45 | Tutorial 1: Multimodal Deep Learning, by Prof. Xavier Giro-i-Nieto |
13:45-17:00 | Tutorial 2: New Trends of Simulation and Augmented Visualization in Medicine, by Prof. Lucio Tommaso De Paolis |
13:45-17:00 | VBS rehearsal and closed session |
09:30-16:40 | MANPU Workshop |
09:30-11:00 | MANPU Opening and Invited Talk |
09:30-09:40 | Opening |
09:40-10:40 | Invited Talk: Management of digital resources at the International City of Comics and the Image: projects and needs, by Jean-Philippe Martin |
10:40-11:00 | Q&A with invited speaker |
11:15-12:45 | MANPU Oral Session 1 |
11:15-11:45 | Rita Hartel and Alexander Dunst. "How good is good enough?" Establishing quality thresholds for the automatic text analysis of retro-digitized comics |
11:45-12:15 | Frédéric Rayar and Seiichi Uchida. Comic text detection using neural network approach |
12:15-12:45 | Miki Ueno. Structure Analysis on Common Plot in Four-Scene Comic Story Dataset |
13:45-15:15 | MANPU Oral Session 2 |
13:45-14:15 | Nhu Van Nguyen, Christophe Rigaud, and Jean-Christophe Burie. Multi-task model for comic book image analysis |
14:15-14:45 | Byeongseon Park and Mitsunori Matsushita. Estimating Comic Content from The Book Cover Information Using Fine-Tuned VGG Model |
14:45-15:15 | Jochen Laubrock and David Dubray. CNN-based Classification of Illustrator Style in Graphic Novels: Which Features Contribute Most? |
15:30-17:00 | MANPU Discussion and Closing Session |
15:30-16:30 | Discussion session |
16:30-16:40 | Closing |
Wednesday, January 9th, 2019 | |
---|---|
09:00-09:20 | Conference Opening |
09:20-10:20 | Keynote Talk 1: Prof. Daniel Gatica-Perez (Session Chair: Ioannis Kompatsiaris) |
10:50-12:30 | Oral Session 1: Best Paper Session (Session Chair: Cathal Gurrin) |
10:50-11:10 | Junyi Wang, Bing-Kun Bao, and Changsheng Xu. Sentiment-aware Multi-modal Recommendation on Tourist Attractions |
11:10-11:30 | Kai-jun Zhang, Cheng-Hao Guo, Zhong-Han Niu, Lu-Fei Liu, and Yu-Bin Yang. SCOD:Dynamical Spatial Constraints for Object Detection |
11:30-11:50 | Guang Chen, Yuexian Zou, and Can Zhang. STMP: Spatial Temporal Multi-level Proposal Network for Activity Detection |
11:50-12:10 | Junchao Zhang and Yuxin Peng. Hierarchical Vision-Language Alignment for Video Captioning |
12:10-12:30 | Alexander Kupin, Benjamin Moeller, Yijun Jiang, Natasha Kholgade Banerjee, and Sean Banerjee. Task-Driven Biometric Authentication of Users in Virtual Reality (VR) Environments |
13:30-15:10 | Oral Session 2A: Special Session 2 – MAPTA |
13:30-13:40 | Manuel Stein, Daniel Seebacher, Tassilo Karge, Tom Polk, Michael Grossniklaus, and Daniel A. Keim. From Movement to Events: Improving Soccer Match Annotations |
13:40-13:50 | Lyndon Nixon, Evlampios Apostolidis, Foteini Markatopoulou, Ioannis Patras, and Vasileios Mezaris. Multimodal Video Annotation for Retrieval and Discovery of Newsworthy Video in a News Verification Scenario |
13:50-14:00 | Snorri Gíslason, Björn Þór Jónsson, and Laurent Amsaleg. Integration of Exploration and Search: A Case Study of the M^3 Model |
14:00-14:10 | Werner Bailer. Face Swapping for Solving Collateral Privacy Issues in Multimedia Analytics |
14:10-14:20 | Alan F. Smeaton, Yvette Graham, Kevin McGuinness, Noel E. O’Connor, Seán Quinn, and Eric Arazo Sanchez. The Impact of Training Data Bias on Automatic Generation of Video Captions |
14:20-15:10 | Panel discussion |
13:30-15:10 | Oral Session 2B: 3D & VR (Session Chair: Wen-Huang Cheng) |
13:30-13:50 | Lingyun Yu, Jun Yu, and Qiang Ling. Deep Neural Network Based 3D Articulatory Movement Prediction Using Both Text and Audio Inputs |
13:50-14:10 | Kyriaki Christaki, Emmanouil Christakis, Petros Drakoulis, Alexandros Doumanoglou, Nikolaos Zioulis, Dimitrios Zarpalas, and Petros Daras. Subjective Visual Quality Assessment of Immersive 3D Media Compressed by Open-Source Static 3D Mesh Codecs |
14:10-14:30 | Kedong Liu, Yanwei Liu, Jinxia Liu, Antonios Argyriou, and Ying Ding. Joint EPC and RAN Caching of Tiled VR Videos for Mobile Networks |
14:30-14:50 | Adam Siekawa, Michał Chwesiuk, Radosław Mantiuk, and Rafał Piórkowski. Foveated Ray Tracing for VR Headsets |
14:50-15:10 | Marek Wernikowski, Radoslaw Mantiuk, and Rafał Piórkowski. Preferred Model of Adaptation to Dark for Virtual Reality Headsets |
16:00-19:00 | Video Browser Showdown |
Klaus Schoeffmann, Bernd Münzer, Andreas Leibetseder, Jürgen Primus, and Sabrina Kletz. Autopiloting Feature Maps: The Deep Interactive Video Exploration (diveXplore) System at VBS2019 |
|
Giuseppe Amato, Paolo Bolettieri, Fabio Carrara, Franca Debole, Fabrizio Falchi, Claudio Gennaro, Lucia Vadicamo, and Claudio Vairo. VISIONE at VBS2019 | |
Jakub Lokoč, Gregor Kovalčík, Tomáš Souček, Jaroslav Moravec, Jan Bodnár, and Přemysl Čech. VIRET Tool Meets NasNet |
|
Stelios Andreadis, Anastasia Moumtzidou, Damianos Galanopoulos, Foteini Markatopoulou, Konstantinos Apostolidis, Thanassis Mavropoulos, Ilias Gialampoukidis, Stefanos Vrochidis, Vasileios Mezaris, Ioannis Kompatsiaris, and Ioannis Patras. VERGE in VBS 2019 | |
Phuong Anh Nguyen, Chong-Wah Ngo, Danny Francis, and Benoit Huet. VIREO @ Video Browser Showdown 2019 | |
Luca Rossetto, Mahnaz Amiri Parian, Ralph Gasser, Ivan Giangreco, Silvan Heller, and Heiko Schuldt. Deep Learning-based Concept Detection in vitrivr |
Thursday, January 10th, 2019 | |
---|---|
09:20-10:20 | Keynote Talk 2: Prof. Andreas Symeonidis (Session Chair: Benoit Huet) |
10:50-12:30 | Oral Session 3A: Special Session 1 – PDAL |
10:50-11:00 | Owen Corrigan and Suzanne Little. Fashion Police: Towards Semantic Indexing of Clothing Information In Surveillance Data |
11:00-11:10 | Yijun Jiang, Elim Schenck, Spencer Kranz, Sean Banerjee, and Natasha Kholgade Banerjee. CNN-Based Non-Contact Detection of Food Level in Bottles from RGB Images |
11:10-11:20 | Zhixiang Ji, Jie Tang, and Gangshan Wu. Personalized Recommendation of Photography based on Deep Learning |
11:20-11:30 | Xiaohua Wang, Muzi Peng, Lijuan Pan, Min Hu, Chunhua Jin, and Fuji Ren. Two-level Attention with Multi-task Learning for Facial Emotion Estimation |
11:30-11:40 | Aaron Duane and Cathal Gurrin. User Interaction for Visual Lifelog Retrieval in a Virtual Environment |
11:40-12:30 | Panel discussion |
10:50-12:30 | Oral Session 3B: MM Indexing and Mining (Session Chair: Stefanos Vrochidis) |
10:50-11:10 | Shuhei Tsuchida, Satoru Fukayama, and Masataka Goto. Query-by-Dancing: A Dance Music Retrieval System Based on Body-Motion Similarity |
11:10-11:30 | Xuelin Zhu, Biwei Cao, Shuai Xu, Bo Liu, and Jiuxin Cao. Joint Visual-Textual Sentiment Analysis Based on Cross-modality Attention Mechanism |
11:30-11:50 | Chang Zhou, Lai-Man Po, Mengyang Liu, Wilson Y.F. Yuen, Peter H. W. Wong, Hon-Tung Luk, Kin Wai Lau, and Hok Kwan Cheung. Deep Hashing with Triplet Labels and Unification Binary Code Selection for Fast Image Retrieval |
11:50-12:10 | Martin Winter and Werner Bailer. Incremental Training for Face Recognition |
12:10-12:30 | Ke Sun, Zhuo Lei, Jiasong Zhu, Xianxu Hou, Bozhi Liu, and Guoping Qiu. Character Prediction in TV Series via a Semantic Projection Network |
13:30-15:10 | Oral Session 4A: Special Session 3 – MDRE |
13:30-13:40 | Cathal Gurrin, Klaus Schoeffmann, Hideo Joho, Bernd Munzer, Rami Albatal, Frank Hopfgartner, Liting Zhou, and Duc-Tien Dang-Nguyen. A Test Collection for Interactive Lifelog Retrieval |
13:40-13:50 | Tomohiro Sato, Minh-Son Dao, Kota Kuribayashi, and Koji Zettsu. SEPHLA: Challenges and Opportunities Within Environment-Personal Health Archives |
13:50-14:00 | Theodoros Giannakopoulos, Margarita Orfanidi, and Stavros Perantonis. Athens Urban Soundscape (ATHUS): A Dataset for Urban Soundscape Quality Recognition |
14:00-14:10 | Luca Rossetto, Heiko Schuldt, George Awad, and Asad A. Butt. V3C - a Research Video Collection |
14:10-15:10 | Panel discussion |
13:30-15:10 | Oral Session 4B: Deep Learning & Applications (Session Chair: Tat-Seng Chua) |
13:30-13:50 | Minho Park, Hak Gu Kim, and Yong Man Ro. Photo-realistic Facial Emotion Synthesis using Multi-level Critic Networks with Multi-level Generative Model |
13:50-14:10 | Xierong Zhu, Jiawei Liu, Hongtao Xie, and Zheng-Jun Zha. Adaptive Alignment Network for Person Re-identification |
14:10-14:30 | Yongchao Xu, Qizheng Yang, Chaoran Cui, Cheng Shi, Guangle Song, Xiaohui Han, and Yilong Yin. Visual Urban Perception with Deep Semantic-Aware Network |
14:30-14:50 | Zhuopeng Li and Xiaoyan Zhang. Deep Reinforcement Learning for Automatic Thumbnail Generation |
14:50-15:10 | Yu-Chieh Chen, Daniel Stanley Tan, Wen-Huang Cheng, and Kai-Lung Hua. 3D Object Completion via Class-conditional Generative Adversarial Network |
15:40-17:20 | Poster Session 1: Posters (Session Chair: Phoebe Chen) |
Konstantinos Apostolidis and Vasileios Mezaris. Image Aesthetics Assessment using Fully Convolutional Neural Networks | |
Markos Zampoglou, Fotini Markatopoulou, Gregoire Mercier, Despoina Touska, Evlampios Apostolidis, Symeon Papadopoulos, Roger Cozien, Ioannis Patras, Vasileios Mezaris, and Ioannis Kompatsiaris. Detecting Tampered Videos with Multimedia Forensics and Deep Learning | |
Boubacar Diallo, Thierry Urruty, Pascal Bourdon, and Christine Fernandez-Maloigne. Improving Robustness of Image Tampering Detection for Compression | |
Patrice Guyot, Thierry Malon, Geoffrey Roman-Jimenez, Sylvie Chambon, Vincent Charvillat, Alain Crouzil, André Péninou, Julien Pinquier, Florence Sèdes, and Christine Sénac. Audiovisual Annotation Procedure for Multi-view Field Recordings | |
Nan Ran, Longteng Kong, Yunhong Wang, and Qingjie Liu. A Robust Multi-Athlete Tracking Algorithm by Exploiting Discriminant Features and Long-Term Dependencies | |
Marios Krestenitis, Georgios Orfanidis, Konstantinos Ioannidis, Konstantinos Avgerinakis, Stefanos Vrochidis, and Ioannis Kompatsiaris. Early Identification of Oil Spills in Satellite Images Using Deep CNNs | |
Xu Cao and Katashi Nagao. Point Cloud Colorization Based on Densely Annotated 3D Shape Dataset | |
Nikolaos Bastas, Theodoros Semertzidis, Apostolos Axenopoulos, and Petros Daras. evolve2vec: Learning Network Representations Using Temporal Unfolding | |
Dunja Vucic and Lea Skorin-Kapov. The Impact of Packet Loss and Google Congestion Control on QoE for WebRTC-based Mobile Multiparty Audiovisual Telemeetings | |
Can Zhang, Yuexian Zou, and Guang Chen. Hierarchical Temporal Pooling for Efficient Online Action Recognition | |
Xianyu Wu, Xiaojie Li, Jia He, Xi Wu, and Imran Mumtaz. Generative Adversarial Networks with Enhanced Symmetric Residual Units for Single Image Super-Resolution | |
Anastasia Ioannidou, Elisavet Chatzilari, Spiros Nikolopoulos, and Yiannis Kompatsiaris. 3D ResNets for 3D Object Classification | |
Xin Lai, Xirong Li, Rui Qian, Dayong Ding, Jun Wu, and Jieping Xu. Four Models for Automatic Recognition of Left and Right Eye in Fundus Images | |
Alexander Schindler and Andreas Rauber. On the unsolved problem of Shot Boundary Detection for Music Videos | |
Chao Liu, Yuexian Zou, and Dongming Yang. Enhancing Scene Text Detection via Fused Semantic Segmentation Network with Attention | |
Zhipeng Wu, Hui Tian, Xuzhen Zhu, Shaoshuai Fan, and Shuo Wang. Exploiting Incidence Relation Between Subgroups for Improving Clustering-Based Recommendation Model | |
Yirui Wu, Weigang Xu, Qinghan Yu, Jun Feng, and Tong Lu. Hierarchical Bayesian Network based Incremental Model for Flood Prediction | |
Dan Wang, Yun Sheng, and Guixu Zhang. A New Female Body Segmentation and Feature Localisation Method for Image-based Anthropometry | |
Ioannis Mademlis, Anastasios Tefas, and Ioannis Pitas. Greedy Salient Dictionary Learning For Activity Video Summarization | |
Jinzhong Lin, Junbiao Pang, Li Su, Yugui Liu, and Qingming Huang. Accelerating Topic Detection on Web for a Large-Scale Data Set via Stochastic Poisson Deconvolution | |
Siming Cui, Xuanjing Shen, and Yingda Lyu. Automatic Segmentation of Brain Tumor Image Based on Region Growing with Co-constraint | |
Nami Iino, Mayumi Shimada, Takuichi Nishimura, and Masatoshi Hamanaka. Proposal of an Annotation Method for Integrating Musical Technique Knowledge Using a GTTM Time-Span Tree | |
Wenliang Zeng, and Ji Liu. A Hierarchical Level Set Approach to for RGBD Image Matting | |
Wei-Ta Chu and Hao-An Chu. A Genetic Programming Approach to Integrate Multilayer CNN Features for Image Classification | |
Madhumita A. Takalkar, Haimin Zhang, and Min Xu. Improving Micro-Expression Recognition Accuracy using Twofold Feature Extraction | |
Li Yao, Ya Lin, Chunbo Zhu, and Zuolong Wang. An Effective Dual-fisheye Lens Stitching Method based on Feature Points | |
Xin Liu and Guoying Zhao. 3D Skeletal Gesture Recognition via Sparse Coding of Time-Warping Invariant Riemannian Trajectories | |
Hengtong Hu, Richang Hong, Weijie Fu, and Meng Wang. Efficient Graph Based Multi-view Learning | |
Jesús Jorrín and Luis Buera. DANTE Speaker Recognition Module. An Efficient and Robust Automatic Speaker Searching Solution for Terrorism-related Scenarios |
Friday, January 11th, 2019 | |
---|---|
09:20-10:20 | Keynote Talk 3: Prof. Martha Larson (Session Chair: Vasileios Mezaris) |
10:50-12:30 | Oral Session 5A: Special Session 4 – CTA |
10:50-11:05 | Luis Lebron Casas and Eugenia Koblents. Video Summarization with LSTM and Deep Attention Models |
11:05-11:20 | Jodie Gauvain, Lori Lamel, Viet Bac Le, Julien Despres, Jean-Luc Gauvain, Abdel Messaoudi, Bianca Vieru, and Waad Ben Kheder. Challenges in Audio Processing of Terrorist-related Data |
11:20-11:35 | George Kalpakis, Theodora Tsikrika, Stefanos Vrochidis, and Yiannis Kompatsiaris. Identifying Terrorism-related Key Actors in Multidimensional Social Networks |
11:35-11:50 | Alexander Schindler, Martin Boyer, Andrew Lindley, David Schreiber, and Thomas Philipp. Large Scale Audio-Visual Video Analytics Platform for Forensic Investigations of Terroristic Attacks |
11:50:12:05 | Andrea Ciapetti, Giulia Ruggiero, and Daniele Toti. A Semantic Knowledge Discovery Framework for Detecting Online Terrorist Networks |
12:05-12:20 | Konstantinos Gkountakos, Theodoros Semertzidis, Georgios Th. Papadopoulos, and Petros Daras. A Reliability Object Layer for Deep Hashing-based Visual Indexing |
10:50-12:30 | Oral Session 5B: Audio & Speech (Session Chair: Vasileios Mezaris) |
10:50-11:10 | Rui Zhang, Ruimin Hu, Gang Li, and Xiaochen Wang. Spectral Tilt Estimation for Speech Intelligibility Enhancement using RNN based on All-pole Model |
11:10-11:30 | Dading Chong, Yuexian Zou, and Wenwu Wang. Multi-Channel Convolutional Neural Networks with Multi-level Feature Fusion for Environmental Sound Classification |
11:30-11:50 | Hirofumi Takamori, Takayuki Nakatsuka, Satoru Fukayama, Masataka Goto, and Shigeo Morishima. Audio-Based Automatic Generation of a Piano Reduction Score by Considering the Musical Structure |
11:50-12:10 | Alfonso Perez-Carrillo. Violin Timbre Navigator: Real-time Visual Feedback of Violin Bowing based on Audio Analysis and Machine Learning |
12:10-12:30 | Odette Scharenborg, Nikki van der Gouw, Martha Larson, and Elena Marchiori. The Representation of Speech in Deep Neural Networks |
13:30-15:10 | Oral Session 6A: Special Session 5 – TCMA |
13:30-13:50 | Tairan Zhang, Congyan Lang, and Junliang Xing. Realtime Human Segmentation in Video |
13:50-14:10 | Chunyang Li, Caiyan Jia, Zhineng Chen, Xiaoyan Gu, and Hongyun Bao. psDirector: An Automatic Director for Watching View Generation from Panoramic Soccer Video |
14:10-14:30 | Li Su, Pamela Cosman, and Qihang Peng. No-Reference Video Quality Assessment Based on Ensemble of Knowledge and Data-Driven Models |
14:30-14:50 | Jiajie Dai and Simon Dixon. Understanding Intonation Trajectories and Patterns of Vocal Notes |
13:30-15:10 | Oral Session 6B: Industry Session (Session Chair: Eduard Vazquez) |
13:30-13:50 | Nudrat Nida, Muhammad Haroon Yousaf, Aun Irtaza, and Sergio A. Velastin. Bag of Deep Features for Instructor Activity Recognition in Lecture Room |
13:50-14:10 | Srijan Das, Monique Thonnat, Kaustubh Sakhalkar, Michal Koperski, Francois Bremond, and Gianpiero Francesca. A New Hybrid Architecture for Human Activity Recognition from RGB-D videos |
14:10-14:30 | Tom Durand, Xiyan He, Ionel Pop, and Lionel Robinault. Utilizing Deep Object Detector for Video Surveillance Indexing and Retrieval |
14:30-14:50 | Mehryar Emambakhsh, Alessandro Bay, and Eduard Vazquez. Deep Recurrent Neural Network for Multi-target Filtering |
14:50-15:10 | Renjie Xie, Yuancheng Wang, Tian Xie, Yuhao Zhang, Li Xu, Jian Lu, and Qiao Wang. Adversarial Training for Video Disentangled Representation |
15:40-17:20 | Poster Session 2: Posters and Demos (Session Chair: Yong Man Ro) (VBS systems will also be demonstrated in this session) |
Damianos Galanopoulos and Vasileios Mezaris. Temporal Lecture Video Fragmentation using Word Embeddings | |
Chaohao Lu and Yuexian Zou. Using Coarse Label Constraint for Fine-grained Visual Classification | |
Danny Francis, Benoit Huet, and Bernard Merialdo. Gated Recurrent Capsules for Visual Word Embeddings | |
Yisheng Yue, Palaiahnakote Shivakumara, Yirui Wu, Liping Zhu, Tong Lu, and Umapada Pal. An Automatic System for Generating Artificial Fake Character Images | |
Wenfeng Zhang, Zhiqiang Wei, Lei Huang, Jie Nie, Lei Lv, and Guanqun Wei. Person Re-Identification Based on Pose-aware Segmentation | |
Chih-Wei Lin and Qilu Ding. Neuropsychiatric Disorders Identification using Convolutional Neural Network | |
Efstratios Kakaletsis, Maria Tzelepi, Pantelis I. Kaplanoglou, Charalampos Symeonidis, Nikos Nikolaidis, Anastasios Tefas, and Ioannis Pitas. Semantic Map Annotation through UAV Video Analysis using Deep Learning Models in ROS | |
Minglei Yang, Yan Song, Xiangbo Shu, and Jinhui Tang. Temporal Action Localization Based on Temporal Evolution Model and Multiple Instance Learning | |
Jia-Li Tao, Jian-Ming Zhang, Liang-Jun Wang, Xiang-Jun Shen, and Zheng-Jun Zha. Near-duplicate Video Retrieval through Toeplitz Kernel Partial Least Squares | |
Hongyang Li, Jun Chen, Ruimin Hu, Mei Yu, Huafeng Chen, and Zengmin Xu. Action Recognition Using Visual Attention with Reinforcement Learning | |
Junqing Yu, Aiping Lei, and Yangliu Hu. Soccer Video Event Detection Based on Deep Learning | |
Jinna Lv and Bin Wu. Spatio-Temporal Attention Model Based on Multi-View for Social Relation Understanding | |
Ting Wu, Qing Xu, Yunhe Li, Yuejun Guo, and Klaus Schoeffmann. Detail-Preserving Trajectory Summarization Based on Segmentation and Group-Based Filtering | |
Fang Wen, Zehang Lin, Zhenguo Yang, and Wenyin Liu. Single-Stage Detector with Semantic Attention for Occluded Pedestrian Detection | |
Xian Zhong, Meng Feng, Wenxin Huang, Zheng Wang, and Shin’ichi Satoh. Poses Guide Spatiotemporal Model for Vehicle Re-identification | |
Jui-Yuan Su, Shyi-Chyi Cheng, Chin-Chun Chang, and Jun-Wei Hsieh. Alignment of Deep Features in 3D Models for Camera Pose Estimation | |
Wenzhe Wang, Bin Wu, Jinna Lv, and Pilin Dai. Regular and Small Target Detection | |
Yannick Le Cacheux, Hervé Le Borgne, and Michel Crucianu. From Classical to Generalized Zero-Shot Learning: a Simple Adaptation Process | |
Masayuki Tamura and Satoshi Nakamura. A Method for Enriching Video-watching Experience with Applied Effects Based on Eye Movements | |
Junki Saito and Satoshi Nakamura. Fontender: Interactive Japanese Text Design with Dynamic Font Fusion Method for Comics | |
Iacopo Vagliano, Angela Fessl, Franziska Guenther, Thomas Koehler, Vasileios Mezaris, Ahmed Saleh, Ansgar Scherp, and Ilija Simic. Training Researchers with the MOVING Platform | |
Kyriaki Christaki, Konstantinos C. Apostolakis, Alexandros Doumanoglou, Nikolaos Zioulis, Dimitrios Zarpalas, and Petros Daras. Space Wars: An AugmentedVR Game | |
Bernd Münzer, Andreas Leibetseder, Sabrina Kletz, and Klaus Schöffmann. ECAT - Endoscopic Concept Annotation Tool | |
Juan Soler-Company and Leo Wanner. Automatic Classification and Linguistic Analysis of Extremist Online Material |