Abstract is missing.
- Summary Embedded Deep Learning Object Detection Model CompetitionJiun-In Guo, Chia-Chi Tsai, Yong-Hsiang Yang, Hung-Wei Lin, Bo-Xun Wu, Ted T. Kuo, Li-Jen Wang. 1-5 [doi]
- Super-resolution of Omnidirectional Images Using Adversarial LearningCagri Ozcinar, Aakanksha Rana, Aljosa Smolic. 1-6 [doi]
- Spatiotemporal Modeling and Label Distribution Learning for Video SummarizationWei-Ta Chu, Yu-Hsin Liu. 1-6 [doi]
- Multi-label Few-shot Learning for Sound Event RecognitionKai-Hsiang Cheng, Szu-Yu Chou, Yi-Hsuan Yang. 1-5 [doi]
- Semantic Segmentation in Compressed VideosAng Li, Yiwei Lu, Yang Wang 0003. 1-5 [doi]
- On the accuracy of video quality measurement techniquesDeepthi Nandakumar, Yongjun Wu, Hai Wei, Avisar Ten-Ami. 1-6 [doi]
- Hand-hygiene activity recognition in egocentric videoChengzhang Zhong, Amy R. Reibman, Hansel Mina Cordoba, Amanda J. Deering. 1-6 [doi]
- Predicting Biological Signals from Speech: Introducing a Novel Multimodal Dataset and ResultsAlice Baird, Shahin Amiriparian, Miriam Berschneider, Maximilian Schmitt, Björn W. Schuller. 1-5 [doi]
- Thermal Facial Landmark Detection by Deep Multi-Task LearningWei-Ta Chu, Yu-Hui Liu. 1-6 [doi]
- EVS and OPUS Audio Coders Performance Evaluation for Oriental and Orchestral Musical InstrumentsYasser A. Zenhom, Eman Mohammed, Micheal N. Mikhael, Hala A. Mansour. 1-6 [doi]
- Predicting Subjectivity in Image Aesthetics AssessmentChen Kang, Giuseppe Valenzise, Frédéric Dufaux. 1-6 [doi]
- Can Deep Generative Audio be Emotional? Towards an Approach for Personalised Emotional Audio GenerationAlice Baird, Shahin Amiriparian, Björn W. Schuller. 1-5 [doi]
- A Secure Visual-thermal Fused Face Recognition System Based on Non-Linear HashingXing-Bo Dong, KokSheik Wong, Zhe Jin, Jean-Luc Dugelay. 1-6 [doi]
- Visual Navigation of Large Image GraphsNico Hezel, Kai-Uwe Barthel, Konstantin Schall, Klaus Jung. 1 [doi]
- Accurate small bowel lesions detection in wireless capsule endoscopy images using deep recurrent attention neural networkRémi Vallée, Astrid de Maissin, Antoine Coutrot, Nicolas Normand, Arnaud Bourreille, Harold Mouchère. 1-5 [doi]
- Deep Metric Learning using Similarities from Nonlinear Rank ApproximationsKonstantin Schall, Kai-Uwe Barthel, Nico Hezel, Klaus Jung. 1-6 [doi]
- On the usage of visual saliency models for computer generated objectsMona Abid, Matthieu Perreira Da Silva, Patrick Le Callet. 1-5 [doi]
- Injective State-Image Mapping facilitates Visual Adversarial Imitation LearningSubhajit Chaudhury, Daiki Kimura, Asim Munawar, Ryuki Tachibana. 1-6 [doi]
- An Efficient Logo Insertion Method for Video Coding in HEVCYunchang Li, Zhijie Huang, Jun Sun 0012. 1-5 [doi]
- Monocular Camera Target Detection and LocationBin Fu, Baiquan Zhao, Yang Cheng. 1-3 [doi]
- Multi-Class Lane Semantic Segmentation using Efficient Convolutional NetworksShao-Yuan Lo, Hsueh-Ming Hang, Sheng-Wei Chan, Jing-Jhih Lin. 1-6 [doi]
- Improving speech intelligibility using microphones on behind the ear hearing aidsYusuke Hioka, Kei Kobayashi, Kenta Niwa. 1-4 [doi]
- Study on user quitting rate for adaptive bitrate video streamingPierre R. Lebreton, Kazuhisa Yamagishi. 1-6 [doi]
- LiteEmo: Lightweight Deep Neural Networks for Image Emotion RecognitionYan-Han Chew, Lai-Kuan Wong, John See, Huai-Qian Khor, Balasubramanian Abivishaq. 1-6 [doi]
- A Robust GSC Beamforming Method for Speech Enhancement using Linear Microphone ArrayFeng Ni, Yi Zhou, Hongqing Liu. 1-5 [doi]
- Photo Filter Classification and Filter Recommendation without Much Manual LabelingWei-Ta Chu, Yu-Tzu Fan. 1-6 [doi]
- Lowering Dynamic Power of a Stream-based CNN Hardware AcceleratorDuvindu Piyasena, Rukshan Wickramasinghe, Debdeep Paul, Siew Kei Lam, Meiqing Wu. 1-6 [doi]
- Generative Networks for Synthesizing Human Videos in Text-Defined OutfitsAkshay Malhotra, Viswanathan Swaminathan, Gang Wu, Ioannis D. Schizas. 1-6 [doi]
- An Occlusion Probability Model for Improving the Rendering Quality of ViewsChangjian Zhu, Hong Zhang, Yan Liu, Hongtao Su, Qiuming Liu. 1-5 [doi]
- Data-Dependent Ensemble of Magnitude Spectrum Predictions for Single Channel Speech EnhancementPasi Pertilä. 1-5 [doi]
- Blink-former: Light-aided beamforming for multiple targets enhancementDaiki Horiike, Robin Scheibler, Yukoh Wakabayashi, Nobutaka Ono. 1-6 [doi]
- 3D Facial Expression Recognition Based on Multi-View and Prior Knowledge FusionQuang Nhat Vo, Khanh Tran, Guoying Zhao. 1-6 [doi]
- Comparison of Subjective Quality Test Methods for Omnidirectional Video Quality EvaluationAshutosh Singla, Werner Robitza, Alexander Raake. 1-6 [doi]
- Atrial Fibrillation Detection using Different Duration ECG Signals with SE-ResNetJinjing Zhu, Yue Zhang, Qingqing Zhao. 1-5 [doi]
- Deep Aggregation of Regional Convolutional Activations for Content Based Image RetrievalKonstantin Schall, Kai-Uwe Barthel, Nico Hezel, Klaus Jung. 1-6 [doi]
- Decoder Side Motion Vector Refinement for Versatile Video CodingHan Gao, Semih Esenlik, Zhijie Zhao, Eckehard G. Steinbach, Jianle Chen. 1-6 [doi]
- IMMVP: An Efficient Daytime and Nighttime On-Road Object DetectorCheng-En Wu, Yi-Ming Chan, Chien-Hung Chen, Wen-Cheng Chen, Chu-Song Chen. 1-5 [doi]
- 3D Point Cloud Color Denoising Using Convex Graph-Signal Smoothness PriorsChinthaka Dinesh, Gene Cheung, Ivan V. Bajic. 1-6 [doi]
- One-Shot Video Object Segmentation Using Attention TransferOmit Chanda, Yang Wang. 1-6 [doi]
- Investigation of domain adaptation for acoustic frog species classificationJie Xie, Mingying Zhu. 1-6 [doi]
- Rewritable Data Embedding in JPEG XT Image using Coefficient Count Across LayersJeffrey Ting, KokSheik Wong, Simying Ong. 1-6 [doi]
- NCTU-GTAV360: A 360° Action Recognition Video DatasetSandy Ardianto, Hsueh-Ming Hang. 1-5 [doi]
- Vehicle Positioning and Ranging with Static Traffic Camera based on 2D-3D Tracking and Re-ProjectionZhan Song, Yipeng Liu, Yiling Xu, Le Yang, Quan Li. 1-6 [doi]
- Automatic Detection of Incorrect Location Images Uploaded by UsersHsu-Yung Cheng, Chih-Chang Yu, Hsiang-Yuan Liu, Sih-Ying Chen. 1-5 [doi]
- Insect interaction analysis based on object detection and CNNPaul Tresson, Philippe Tixier, William Puech, Dominique Carval. 1-6 [doi]
- Dynamic Guidance for Depth Map RestorationRan Zhu, Shengju Yu, Xiaoyu Xu, Li Yu. 1-6 [doi]
- Distortion scalable learned image compressionRenam C. da Silva, Vanessa Testoni. 1-6 [doi]
- MRNet: A Competition model for MMSP on Embedded Deep Learning Object DetectionBin Li, Yuyu Chen, Wenfeng Xue, Jiaqi Chen, Zun Weng, Fen Xiao. 1-5 [doi]
- Lightweight Deep Convolutional Neural Networks for Facial Expression RecognitionYanan Wang, Jianming Wu, Keiichiro Hoashi. 1-6 [doi]
- A Quality of Experience Evaluation Comparing Augmented Reality and Paper Based Instruction for Complex Task AssistanceEoghan Hynes, Ronan Flynn, Brian Lee, Niall Murray. 1-6 [doi]
- End-to-End Conditional GAN-based Architectures for Image ColourisationMarc Górriz Blanch, Marta Mrak, Alan F. Smeaton, Noel E. O'Connor. 1-6 [doi]
- Discrete Cosine Basis Oriented Motion Modeling for Fisheye and 360 Degree Video CodingAshek Ahmmed, Manoranjan Paul. 1-5 [doi]
- Scalable 360 Video Streaming using HTTP/2Duc V. Nguyen, Hoang Van Trung, Hoang Le Dieu Huong, Truong Thu Huong, Nam Pham Ngoc 0001, Truong Cong Thang. 1-6 [doi]
- On Data Wastage in Viewport-Dependent StreamingDmitrii Monakhov, Igor D. D. Curcio, Sujeet Mate. 1-6 [doi]
- Improved Patch Packing for the MPEG V-PCC StandardAfonso Costa, Antoine Dricot, Catarina Brites, João Ascenso, Fernando Pereira 0001. 1-6 [doi]
- Subjective Evaluation of 360-degree Sensory ExperiencesÁlan L. V. Guedes, Roberto Gerson De Albuquerque Azevedo, Pascal Frossard, Sérgio Colcher, Simone Diniz Junqueira Barbosa. 1-6 [doi]
- Selective Hearing: A Machine Listening PerspectiveEstefanía Cano, Hanna M. Lukashevich. 1-6 [doi]
- Multi-Label Classification for Automatic Human Blastocyst Grading with Severely Imbalanced DataLisette Lockhart, Parvaneh Saeedi, Jason Au, Jon Havelock. 1-6 [doi]
- Complexity Reduction Opportunities in the Future VVC Intra EncoderA. Tissier, Alexandre Mercat, Thomas Amestoy, Wassim Hamidouche, Jarno Vanne, Daniel Ménard. 1-6 [doi]
- Learning mappings onto regularized latent spaces for biometric authenticationMatteo Testa, Arslan Ali, Tiziano Bianchi, Enrico Magli. 1-6 [doi]
- Securing physical documents with digital signaturesChristian Winter 0001, Waldemar Berchtold, Jan Niklas Hollenbeck. 1-6 [doi]
- Tile-Based Joint Caching and Delivery of 360° Videos in Heterogeneous NetworksPantelis Maniotis, Eirina Bourtsoulatze, Nikolaos Thomos. 1-6 [doi]
- Quantitative Measurement of VR Stereoscopic Video Recording Quality Based on Visual Acuity LossParham Aarabi, Tzu-An Chen, Vladislav Il'govskiy, Anastasia Kolesnikov, Nathaniel Xu, Benzakhar Manashirov. 1-4 [doi]
- YouTube UGC Dataset for Video Compression ResearchYilin Wang, Sasi Inguva, Balu Adsumilli. 1-5 [doi]
- Luminance-based video backdoor attack against anti-spoofing rebroadcast detectionAbhir Bhalerao, Kassem Kallas, Benedetta Tondi, Mauro Barni. 1-6 [doi]
- Data Hiding in Perceptually Masked OpenEXR ImageKaiLin Chia, KokSheik Wong, Jean-Luc Dugelay. 1-6 [doi]
- Adaptive Multi-level Triangle Soup for Geometry-based Point Cloud CodingAntoine Dricot, João Ascenso. 1-6 [doi]
- Improved Vertex Skinning Algorithm Based On Dual QuaternionsHao Yin, Ramakarishnan Mukundan. 1-6 [doi]
- Coherent Crowd Analysis in Still ImageNurul Japar, Chee Seng Chan, Ven Jyn Kok. 1-6 [doi]
- Millimeter Wave meets Edge Computing for Mobile VR with High-Fidelity 8K Scalable 360° VideoSabyasachi Gupta, Jacob Chakareski, Petar Popovski. 1-6 [doi]
- High Precision Target Positioning Method for RSU in Cooperative PerceptionTuopu Wen, Zhongyang Xiao, Kun Jiang, Mengmeng Yang, Keqiang Li, Diange Yang. 1-6 [doi]
- Learning Multiple Sound Source 2D LocalizationGuillaume Le Moing, Phongtharin Vinayavekhin, Tadanobu Inoue, Jayakorn Vongkulbhisal, Asim Munawar, Ryuki Tachibana, Don Joven Agravante. 1-6 [doi]
- Edge Cloud-based Augmented RealityChristoph Bachhuber, Alvaro Sanchez Martinez, Rastin Pries, Sebastian Eger, Eckehard G. Steinbach. 1-6 [doi]
- Better Word Representations with Word WeightGege Song, Xianglin Huang, Gang Cao, Zhulin Tao, Wei Liu, Lifang Yang. 1-5 [doi]
- Incorporating Non-local and Task-specific Features for Instance SegmentationLongrong Yang, Fanman Meng, Qingbo Wu, Hongliang Li. 1-6 [doi]
- Virtual Fakes: DeepFakes for Virtual RealityAvishek Joey Bose, Parham Aarabi. 1 [doi]
- From Speech to Facial Activity: Towards Cross-modal Sequence-to-Sequence Attention NetworksLukas Stappen, Vincent Karas, Nicholas Cummins, Fabien Ringeval, Klaus R. Scherer, Björn W. Schuller. 1-6 [doi]