Abstract is missing.
- Generalizable Solar Irradiation Prediction using Large Transformer Models with Sky ImageryKuber Reddy Gorantla, Aditi Roy. 1-5 [doi]
- Towards Achieving Lightweight Deep Neural Network for Precision Agriculture with Maize Disease DetectionCarlos Victorino Padeiro, Takahiro Komamizu, Ichiro Ide. 1-6 [doi]
- QAHOI: Query-Based Anchors for Human-Object Interaction DetectionJunwen Chen, Keiji Yanai. 1-5 [doi]
- Ensemble Fusion for Small Object DetectionHao-Yu Hou, Mu-Yi Shen, Chia-Chi Hsu, En-Ming Huang, Yu-Chen Huang, Yu-Cheng Xia, Chien-Yao Wang, Chun-Yi Lee. 1-6 [doi]
- Automated Identification of Surgical Instruments without Tagging: Implementation in Real Hospital Work EnvironmentRui Ishiyama, Per Helge Litzheim Frøiland, Stein-Asle Øvrebotn. 1-4 [doi]
- Generalization of pixel-wise phase estimation by CNN and improvement of phase-unwrapping by MRF optimization for one-shot 3D scanHiroto Harada, Michihiro Mikamo, Ryo Furukawa 0001, Ryusuke Sagawa, Hiroshi Kawasaki. 1-5 [doi]
- Mixed Distillation for Unsupervised Anomaly DetectionFuzhen Cai, Siyu Xia. 1-5 [doi]
- Malware detection using Kernel Constrained Subspace MethodDjafer Yahia, Messaoud Benchadi, Bojan Batalo, Kazuhiro Fukui. 1-5 [doi]
- Joint learning of images and videos with a single Vision TransformerShuki Shimizu, Toru Tamaki. 1-6 [doi]
- Deep Randomized Time Warping for Action RecognitionYutaro Hiraoka, Kazuhiro Fukui. 1-5 [doi]
- QaQ: Robust 6D Pose Estimation via Quality-Assessed RGB-D FusionThéo Petitjean, Zongwei Wu, Olivier Laligant, Cédric Demonceaux. 1-7 [doi]
- Uncertainty Criteria in Active Transfer Learning for Efficient Video-Specific Human Pose EstimationHiromu Taketsugu, Norimichi Ukita. 1-5 [doi]
- Human Pose Prediction by Progressive Generation in Multi-scale Frequency DomainTomohiro Fujita, Yasutomo Kawanishi. 1-5 [doi]
- MFFPN: an Anchor-Free Method for Patent Drawing Object DetectionYu-Hsien Chen, Chih-Yi Chiu. 1-5 [doi]
- Joint Learning with Group Relation and Individual ActionChihiro Nakatani, Hiroaki Kawashima, Norimichi Ukita. 1-6 [doi]
- Outline Generation Transformer for Bilingual Scene Text RecognitionJui-Teng Ho, Gee-Sern Hsu, Svetlana N. Yanushkevich, Marina L. Gavrilova. 1-5 [doi]
- LOTS: Litter On The Sand dataset for litter segmentationPaola Barra, Alessia Auriemma Citarella, Giosuè Orefice, Modesto Castrillón Santana, Angelo Ciaramella. 1-4 [doi]
- TinyPedSeg: A Tiny Pedestrian Segmentation Benchmark for Top-Down Drone ImagesYusuf Huseyin Sahin, Elvin Abdinli, M. Arda Aydin, Gozde Unal. 1-5 [doi]
- Low-Level Feature Aggregation Networks for Disease Severity Estimation of Coffee LeavesTakuhiro Okada, Yuantian Huang, Guoqing Hao, Satoshi Iizuka, Kazuhiro Fukui. 1-5 [doi]
- Leveraging Embedding Information to Create Video Capsule Endoscopy DatasetsPere Gilabert, Carolina Malagelada, Hagen Wenzek, Jordi Vitrià, Santi Seguí. 1-5 [doi]
- Image Impression Estimation by Clustering People with Similar TastesBanri Kojima, Takahiro Komamizu, Yasutomo Kawanishi, Keisuke Doman, Ichiro Ide. 1-5 [doi]
- Cross-modal Manifold Cutmix for Self-supervised Video Representation LearningSrijan Das, Michael S. Ryoo. 1-6 [doi]
- Multi-Plane Projection for Extending Perspective Image Object Detection Models to 360° ImagesYasuto Nagase, Yasunori Babazaki, Katsuhiko Takahashi. 1-5 [doi]
- YOLOv5 with Mixed Backbone for Efficient Spatio-Temporal Hand Gesture Localization and RecognitionLuis Acevedo-Bringas, Gibran Benitez-Garcia, Jesus Olivares-Mercado, Hiroki Takahashi. 1-5 [doi]
- MVA2023 Small Object Detection Challenge for Spotting Birds: Dataset, Methods, and ResultsYuki Kondo, Norimichi Ukita, Takayuki Yamaguchi, Hao-Yu Hou, Mu-Yi Shen, Chia-Chi Hsu, En-Ming Huang, Yu-Chen Huang, Yu-Cheng Xia, Chien-Yao Wang, Chun-Yi Lee, Da Huo, Marc A. Kastner 0001, Tingwei Liu, Yasutomo Kawanishi, Takatsugu Hirayama, Takahiro Komamizu, Ichiro Ide, Yosuke Shinya, Xinyao Liu, Guang Liang, Syusuke Yasui. 1-11 [doi]
- Using Unconditional Diffusion Models in Level Generation for Super Mario BrosHyeon Joon Lee, Edgar Simo-Serra. 1-5 [doi]
- Panoptic Segmentation of Galactic Structures in LSB ImagesFelix Richards, Adeline Paiement, Xianghua Xie, Elisabeth Sola, Pierre-Alain Duc. 1-6 [doi]
- Automatic Reconstruction of Semantic 3D Models from 2D Floor PlansAleixo Cambeiro Barreiro, Mariusz Trzeciakiewicz, Anna Hilsmann, Peter Eisert. 1-5 [doi]
- Combining Static Specular Flow and Highlight with Deep Features for Specular Surface DetectionHirotaka Hachiya, Yuto Yoshimura. 1-5 [doi]
- Safe height estimation of deformable objects for picking robots by detecting multiple potential contact pointsJaeSung Yang, Daisuke Hagihara, Kiyoto Ito, Nobuhiro Chihara. 1-5 [doi]
- Bottleneck Transformer model with Channel Self-Attention for skin lesion classificationMasato Tada, Xian-Hua Han. 1-5 [doi]
- BandRe: Rethinking Band-Pass Filters for Scale-Wise Object Detection EvaluationYosuke Shinya. 1-5 [doi]
- Combining Knowledge Distillation and Transfer Learning for Sensor Fusion in Visible and Thermal Camera-based Person ClassificationVijay John, Yasutomo Kawanishi. 1-5 [doi]
- Diabetic Retinopathy Grading based on a Sparse Network Fusion of Heterogeneous ConvNeXt Models with Category AttentionAgustin Castillo-Munguia, Gibran Benitez-Garcia, Jesus Olivares-Mercado, Hiroki Takahashi. 1-5 [doi]
- Hierarchical Spatio-Temporal Neural Network with Displacement Based Refinement for Monocular Head Pose PredictionZhe Xu, Yuan Li, Yuhong Li, Songlin Du, Takeshi Ikenaga. 1-5 [doi]
- MS-VACSNet: A Network for Multi-scale Volcanic Ash Cloud Segmentation in Remote Sensing ImagesG. Swetha, Rajeshreddy Datla, Chalavadi Vishnu, Krishna Mohan C. 1-6 [doi]
- An X3D Neural Network Analysis for Runner's Performance Assessment in a Wild Sporting EnvironmentDavid Freire-Obregón, Javier Lorenzo-Navarro, Oliverio J. Santana, Daniel Hernández-Sosa, Modesto Castrillón Santana. 1-5 [doi]
- Multi-Prior Based Multi-Scale Condition Network for Single-Image HDR ReconstructionHaorong Jiang, Fengshan Zhao, Junda Liao, Qin Liu 0002, Takeshi Ikenaga. 1-5 [doi]
- Padding Investigations for CNNs in Scene Parsing TasksYu-Hui Huang, Marc Proesmans, Luc Van Gool. 1-5 [doi]
- Age Prediction From Face Images Via Contrastive LearningYeongnam Chae, Poulami Raha, Mijung Kim, Björn Stenger. 1-6 [doi]
- Transformer with Task Selection for Continual LearningSheng-Kai Huang, Chun-Rong Huang. 1-5 [doi]
- Interpreting Art by Leveraging Pre-Trained ModelsNiklas Penzel, Joachim Denzler. 1-6 [doi]
- PALF: Pre-Annotation and Camera-LiDAR Late Fusion for the Easy Annotation of Point CloudsYucheng Zhang, Masaki Fukuda, Yasunori Ishii, Kyoko Ohshima, Takayoshi Yamashita. 1-5 [doi]
- Lifelong Change Detection: Continuous Domain Adaptation for Small Object Change Detection in Everyday Robot NavigationKoji Takeda, Kanji Tanaka, Yoshimasa Nakamura. 1-5 [doi]
- Contrastive Knowledge Distillation for Anomaly Detection in Multi-Illumination/Focus Display ImagesJihyun Lee, Hangil Park, Yongmin Seo, Taewon Min, Joodong Yun, Jaewon Kim, Tae-Kyun Kim. 1-5 [doi]
- Can you read lips with a masked face?Taiki Arakane, Chihiro Kai, Takeshi Saitoh. 1-5 [doi]
- Unsupervised Fall Detection on Edge DevicesTakuya Nakabayashi, Hideo Saito. 1-5 [doi]
- CG-based dataset generation and adversarial image conversion for deep cucumber recognitionHiroaki Masuzawa, Chuo Nakano, Jun Miura. 1-5 [doi]
- Hardware-Aware Zero-Shot Neural Architecture SearchYutaka Yoshihama, Kenichi Yadani, Shota Isobe. 1-5 [doi]
- Investigating self-supervised learning for Skin Lesion ClassificationTakumi Morita, Xian-Hua Han. 1-5 [doi]
- Enhancing Retail Product Recognition: Fine-Grained Bottle Size ClassificationKatarina Tolja, Marko Subasic, Zoran Kalafatic, Sven Loncaric. 1-5 [doi]
- ViTVO: Vision Transformer based Visual Odometry with Attention SupervisionChu-Chi Chiu, Hsuan-Kung Yang, Hao-Wei Chen, Yu-Wen Chen, Chun-Yi Lee. 1-5 [doi]
- ASD-EVNet: An Ensemble Vision Network based on Facial Expression for Autism Spectrum Disorder RecognitionAssil Jaby, Md Baharul Islam, Md Atiqur Rahman Ahad. 1-5 [doi]
- Monocular Blind Spot Estimation with Occupancy Grid MappingKazuya Odagiri, Kazunori Onoguchi. 1-6 [doi]
- Multi-class Semantic Segmentation of Tooth Pathologies and Anatomical Structures on Bitewing and Periapical RadiographsJames-Andrew Sarmiento, Liushifeng Chen, Prospero C. Naval Jr.. 1-5 [doi]
- Object Detection for Embedded Systems Using Tiny Spiking Neural Networks: Filtering Noise Through Visual AttentionHugo Bulzomi, Amélie Gruel, Jean Martinet, Takeshi Fujita, Yuta Nakano, Rémy Bendahan. 1-5 [doi]
- Black-box Adversarial Attack against Visual Interpreters for Deep Neural NetworksYudai Hirose, Satoshi Ono. 1-6 [doi]
- Video Anomaly Detection Using Encoder-Decoder Networks with Video Vision Transformer and Channel Attention BlocksShimpei Kobayashi, Akiyoshi Hizukuri, Ryohei Nakayama. 1-4 [doi]
- *Marija Ivanovska, Vitomir Struc, Janez Pers. 1-6 [doi]
- Safe Landing Zone Detection for UAVs using Image Segmentation and Super ResolutionAnagh Benjwal, Prajwal Uday, Aditya Vadduri, Abhishek Pai. 1-6 [doi]
- Tackling Face Verification Edge Cases: In-Depth Analysis and Human-Machine Fusion ApproachMartin Knoche, Gerhard Rigoll. 1-5 [doi]
- Grid Sample Based Temporal Iteration and Compactness-coefficient Distance for High Frame and Ultra-low Delay SLIC Segmentation SystemYuan Li, Tingting Hu, Ryuji Fuchikami, Takeshi Ikenaga. 1-5 [doi]
- Word to Sentence Visual Semantic Similarity for Caption Generation: Lessons LearnedAhmed Sabir. 1-5 [doi]
- Small Object Detection for Birds with Swin TransformerDa Huo, Marc A. Kastner 0001, Tingwei Liu, Yasutomo Kawanishi, Takatsugu Hirayama, Takahiro Komamizu, Ichiro Ide. 1-5 [doi]
- Shape Preservation in Image Style Transfer for Gaze EstimationDaiki Mushiake, Kentaro Otomo, Chihiro Nakatani, Norimichi Ukita. 1-5 [doi]
- Dynamic Transfer for Domain Adaptation in Crowd CountingShekhor Chanda, Yang Wang 0003. 1-5 [doi]
- Domain Adaptation from Visible-Light to FIR with Reliable Pseudo LabelsJuki Tanimoto, Haruya Kyutoku, Keisuke Doman, Yoshito Mekada. 1-5 [doi]
- A Hybrid Wheat Head Detection model with Incorporated CNN and TransformerSho Harada, Xian-Hua Han. 1-5 [doi]
- Quadruped Robot Platform for Selective Pesticide SprayingHansen Hendra, Yubin Liu, Ryoichi Ishikawa, Takeshi Oishi, Yoshihiro Sato. 1-6 [doi]
- Self-Supervised Pre-Training Boosts Semantic Scene Segmentation on LiDAR dataMariona Carós, Ariadna Just, Santi Seguí, Jordi Vitrià. 1-6 [doi]
- Weakly-Supervised Deep Image Hashing based on Cross-Modal TransformerChing-Ching Yang, Wei-Ta Chu, Shiv Ram Dubey. 1-5 [doi]
- Intra-frame Skeleton Constraints Modeling and Grouping Strategy Based Multi-Scale Graph Convolution Network for 3D Human Motion PredictionZhihan Zhuang, Yuan Li, Songlin Du, Takeshi Ikenaga. 1-5 [doi]