Abstract is missing.
- Wearable Camera Based Food Logging SystemKenshiro Sato, Yoko Yamakata, Sosuke Amano, Kiyoharu Aizawa. [doi]
- Human-Avatar Interaction in Metaverse: Framework for Full-Body InteractionKit-Yung Lam, Liang Yang, Ahmad Alhilal, Lik Hang Lee, Gareth Tyson, Pan Hui 0001. [doi]
- Robust Learning with Adversarial Perturbations and Label Noise: A Two-Pronged Defense ApproachPeng-fei Zhang, Zi Huang, Xin Luo, Pengfei Zhao. [doi]
- Emotional Talking Faces: Making Videos More Expressive and RealisticSahil Goyal, Shagun Uppal, Sarthak Bhagat, Dhroov Goel, Sakshat Mali, Yi Yu 0001, Yifang Yin, Rajiv Ratn Shah. [doi]
- Deep Image and Kernel Prior Learning for Blind Super-ResolutionKazuhiro Yamawaki, Xian-Hua Han. [doi]
- On the Robustness of 3D Object DetectorsFatima Albreiki, Sultan Abu Ghazal, Jean Lahoud, Rao Muhammad Anwer, Hisham Cholakkal, Fahad Shahbaz Khan. [doi]
- Towards High Performance One-Stage Human Pose EstimationLing Li, Lin Zhao, Linhao Xu, Jie Xu. [doi]
- Two-Layer Learning-Based P-Frame Coding with Super-Resolution and Content-Adaptive Conditional ANFDavid Alexandre, Hsueh-Ming Hang, Wen-Hsiao Peng. [doi]
- TFM a Dataset for Detection and Recognition of Masked Faces in the WildGibran Benitez-Garcia, Hiroki Takahashi, Miguel Jimenez-Martinez, Jesus Olivares-Mercado. [doi]
- Informative Sample-Aware Proxy for Deep Metric LearningAoyu Li, Ikuro Sato, Kohta Ishikawa, Rei Kawakami, Rio Yokota. [doi]
- A Music Loop Sequencer with User-Adaptive Music Loop SelectionYuki Iwamoto, Tetsuro Kitahara. [doi]
- Self-Attentive CLIP Hashing for Unsupervised Cross-Modal RetrievalHeng Yu, Shuyan Ding, LunBo Li, Jiexin Wu. [doi]
- Action Detection System Based on Pose InformationRyo Kawai, Noboru Yoshida, Jianquan Liu. [doi]
- DeepHair: A DeepFake-Based Hairstyle Preview SystemYu-Hsuan Lo, Shih-Wei Sun. [doi]
- A Reality Check of Positioning in Multiuser Mobile Augmented Reality: Measurement and AnalysisNa Wang, Haoliang Wang, Stefano Petrangeli, Viswanathan Swaminathan, Fei Li 0001, Songqing Chen. [doi]
- FoodLog Athl: Multimedia Food Recording Platform for Dietary Guidance and Food MonitoringKei Nakamoto, Kohei Kumazawa, Hiroaki Karasawa, Sosuke Amano, Yoko Yamakata, Kiyoharu Aizawa. [doi]
- Multispectral Image Denoising via Structural Tensor Sparsity Promoting ModelLonglu Huang, Na Qi, Qing Zhu. [doi]
- Singing Voice Detection via Similarity-Based Semi-Supervised LearningXi Chen, Yongwei Gao, Wei Li 0012. [doi]
- Multimodal Fusion with Cross-Modal Attention for Action Recognition in Still ImagesJia-Hua Tsai, Wei-Ta Chu. [doi]
- 360BroadView: Viewer Management for Viewport Prediction in 360-Degree Video Live BroadcastQian Zhou, Zhe Yang, Hongpeng Guo, Beitong Tian, Klara Nahrstedt. [doi]
- A Multimodal Sensor Fusion Framework Robust to Missing Modalities for Person RecognitionVijay John, Yasutomo Kawanishi. [doi]
- CMR3D: Contextualized Multi-Stage Refinement for 3D Object DetectionDhanalaxmi Gaddam, Jean Lahoud, Fahad Shahbaz Khan, Rao Muhammad Anwer, Hisham Cholakkal. [doi]
- SLGAN: Style- and Latent-Guided Generative Adversarial Network for Desirable Makeup Transfer and RemovalDaichi Horita, Kiyoharu Aizawa. [doi]
- GSTH266enc: A GStreamer Plugin for VVC EncoderAdvaiit Rajjvaed, Saurabh Puri, Gurdeep Bhullar, Gaëlle Martin-Cocher. [doi]
- Rubber Material Retrieval System using Electron Microscope Images for Rubber Material DevelopmentRintaro Yanagi, Ren Togo, Takahiro Ogawa 0001, Miki Haseyama. [doi]
- Enhancing the Robustness of Deep Learning Based Fingerprinting to Improve Deepfake AttributionChieh-Yin Liao, Chen-Hsiu Huang, Jun-Cheng Chen, Ja-Ling Wu. [doi]
- Remote Sensing Image Colorization Based on Joint Stream Deep Convolutional Generative Adversarial NetworksJingyu Wang, Jie Nie, Hao Chen, Huaxin Xie, Chengyu Zheng, Min Ye, Zhiqiang Wei 0002. [doi]
- Image Compression for Machines Using Boundary-Enhanced SaliencyYuanyuan Xu, Haolun Lan. [doi]
- Popularity-Aware Graph Social Recommendation for Fully Non-Interaction UsersNozomu Onodera, Keisuke Maeda, Takahiro Ogawa 0001, Miki Haseyama. [doi]
- Wider or Deeper Neural Network Architecture for Acoustic Scene Classification with Mismatched Recording DevicesLam Pham, Khoa Tran, Dat Ngo, Hieu Tang, Son Phan, Alexander Schindler. [doi]
- ObjectMix: Data Augmentation by Copy-Pasting Objects in Videos for Action RecognitionJun Kimata, Tomoya Nitta, Toru Tamaki. [doi]
- Affective Embedding Framework with Semantic Representations from Tweets for Zero-Shot Visual Sentiment PredictionYingrui Ye, Yuya Moroto, Keisuke Maeda, Takahiro Ogawa 0001, Miki Haseyama. [doi]
- Graph Neural Network Based Living Comfort Prediction Using Real Estate Floor Plan ImagesRyota Kitabayashi, Taro Narahara, Toshihiko Yamasaki. [doi]
- Multi-Scale Channel Transformer Network for Single Image DerainingYuto Namba, Xian-Hua Han. [doi]
- Deep Weighted Guided Upsampling Network for Depth of Field Image UpsamplingLanling Zeng, Lianxiong Wu, Yang Yang 0046, Xiangjun Shen, Yongzhao Zhan. [doi]
- JamSketch Deep α: A CNN-Based Improvisation System in Accordance with User's Melodic Outline DrawingTetsuro Kitahara, Akio Yonamine. [doi]
- Deep Enhancement-Object Features Fusion for Low-Light Object DetectionWan Teng Lim, Kelvin Ang, Yuen Peng Loh. [doi]
- Learned Bi-Directional Motion Prediction for Video CompressionYunhui Shi, Shaopei An, Jin Wang, Baocai Yin. [doi]
- Asymmetric Label Propagation for Video Object SegmentationZhen Chen, Ming Yang 0007, Shiliang Zhang. [doi]
- Sequential Frame-Interpolation and DCT-based Video Compression FrameworkYeganeh Jalalpour, Wu-chi Feng, Feng Liu. [doi]
- Parallel Queries for Human-Object Interaction DetectionJunwen Chen, Keiji Yanai. [doi]
- An End-to-End Scene Text Detector with Dynamic AttentionJingyu Lin, Yan Yan, Hanzi Wang. [doi]
- Zero-Shot Font Style Transfer with a Differentiable RendererKota Izumi, Keiji Yanai. [doi]
- Intelligent Video Surveillance Platform Based on FFmpeg and Yolov5Chuanxu Jiang, Yanfang Wang, Qian Huang, Yiming Wang, Yuhan Dai. [doi]
- Disentangled Image Attribute Editing in Latent Space via Mask-Based Retention LossShunya Ohaga, Ren Togo, Takahiro Ogawa 0001, Miki Haseyama. [doi]
- Federated Knowledge Transfer for Heterogeneous Visual ModelsWenzhe Li, Zirui Zhu, Tianchi Huang, Lifeng Sun, Chun Yuan. [doi]
- SPEAKER VGG CCT: Cross-Corpus Speech Emotion Recognition with Speaker Embedding and Vision TransformersAlessandro Arezzo, Stefano Berretti. [doi]