Abstract is missing.
- Coughtrigger: Earbuds IMU Based Cough Detection Activator Using An Energy-Efficient Sensitivity-Prioritized Time Series ClassifierShibo Zhang, Ebrahim Nemati, Minh Dinh, Nathan Folkman, Tousif Ahmed, Md. Mahbubur Rahman, Jilong Kuang, Nabil Alshurafa, Alex Gao 0001. 1-5 [doi]
- Adversarial Examples Detection Based on Error Level Analysis and Space MappingSizhao Huang, Shuai Wang, Jian Chen, Guozhi Li, Wenyi Wang. 1-5 [doi]
- Metricbert: Text Representation Learning Via Self-Supervised Triplet TrainingItzik Malkiel, Dvir Ginzburg, Oren Barkan, Avi Caciularu, Yoni Weill, Noam Koenigstein. 1-5 [doi]
- Global Optimization Solution for Dynamic Adaptive 360-Degree StreamingXuekai Wei, Mingliang Zhou, Weijia Jia 0001. 1-5 [doi]
- Non-Invasive Blood Pressure Monitoring with Multi-Modal In-Ear SensingHoang Truong, Alessandro Montanari, Fahim Kawsar. 6-10 [doi]
- Intelligent Wi-Fi Based Child Presence Detection SystemXiaolu Zeng, Beibei Wang 0001, Chenshu Wu, Sai Deepika Regani, K. J. Ray Liu. 11-15 [doi]
- Real-Time Fall Detection Using Mmwave RadarWenxuan Li, Dongheng Zhang, Yadong Li, Zhi Wu, Jinbo Chen, Dong Zhang, Yang Hu, Qibin Sun, Yan Chen. 16-20 [doi]
- Hierarchical Deep Learning Model with Inertial and Physiological Sensors Fusion for Wearable-Based Human Activity RecognitionDae-Yon Hwang, Pai Chet Ng, Yuanhao Yu, Yang Wang, Petros Spachos, Dimitrios Hatzinakos, Konstantinos N. Plataniotis. 21-25 [doi]
- Speech Recovery For Real-World Self-Powered Intermittent DevicesYu-Chen Lin, Tsun-An Hsieh, Kuo-Hsuan Hung, Cheng Yu, Harinath Garudadri, Yu Tsao 0001, Tei-Wei Kuo. 26-30 [doi]
- Phase Control of Parametric Array Loudspeaker by Optimizing Sideband WeightsAi Okano, Yoshinobu Kajikawa. 31-35 [doi]
- Low-Latency Human-Computer Auditory Interface Based on Real-Time Vision AnalysisFlorian Scalvini, Camille Bordeau, Maxime Ambard, Cyrille Migniot, Julien Dubois. 36-40 [doi]
- Robust Adaptive Noise Canceller Algorithm with Snr-Based Stepsize Control and Noise-Path Gain CompensationAkihiko Sugiyama. 41-45 [doi]
- Neartracker: Acoustic 2-D Target Tracking with Nearby Reflector in Siso SystemChao Liu, Linlin Gao, Ruobing Jiang. 46-50 [doi]
- An Efficient Method For Generic Dsp Implementation Of Dilated ConvolutionHarinarayanan. E. V, Sachin Ghanekar. 51-55 [doi]
- Compression-Aware Projection with Greedy Dimension Reduction for Convolutional Neural Network ActivationsYu-Shan Tai, Chieh-Fang Teng, Cheng-Yang Chang, An-Yeu Andy Wu. 56-60 [doi]
- Optimizing The Consumption Of Spiking Neural Networks With Activity RegularizationSimon Narduzzi, Siavash Arjomand Bigdeli, Shih-Chii Liu, L. Andrea Dunbar. 61-65 [doi]
- IMPQ: Reduced Complexity Neural Networks Via Granular Precision AssignmentSujan Kumar Gonugondla, Naresh R. Shanbhag. 66-70 [doi]
- Rate Coding Or Direct Coding: Which One Is Better For Accurate, Robust, And Energy-Efficient Spiking Neural Networks?Youngeun Kim, Hyoungseob Park, Abhishek Moitra, Abhiroop Bhattacharjee, Yeshwanth Venkatesha, Priyadarshini Panda. 71-75 [doi]
- PYXIS: An Open-Source Performance Dataset Of Sparse AcceleratorsLinghao Song, Yuze Chi, Jason Cong. 76-80 [doi]
- Fast Fault Diagnosis Method Of Rolling Bearings In Multi-Sensor Measurement EnviromentZuozhou Pan, Zhiping Lin, Yuanjin Zheng, Zong Meng. 81-85 [doi]
- Detecting Anomaly in Chemical Sensors via Regularized Contrastive LearningDiaa Badawi, Ishaan Bassi, Sule Ozev, Ahmet Enis Çetin. 86-90 [doi]
- Evolutionary Neural Architecture Design of Liquid State Machine for Image ClassificationCheng Tang, Junkai Ji, Qiuzhen Lin, Yan Zhou. 91-95 [doi]
- Invisible and Efficient Backdoor Attacks for Compressed Deep Neural NetworksHuy Phan, Yi Xie 0001, Jian Liu, Yingying Chen, Bo Yuan. 96-100 [doi]
- Tensor-Based Orthogonal Matching Pursuit with Phase Rotation for Channel Estimation In Hybrid Beamforming Mimo-Ofdm SystemsCheng-Hung Lo, Pei-Yun Tsai. 101-105 [doi]
- Spain-Net: Spatially-Informed Stereophonic Music Source SeparationDarius Petermann, Minje Kim. 106-110 [doi]
- Improved Singing Voice Separation with Chromagram-Based Pitch-Aware RemixingSiyuan Yuan, Zhepei Wang, Umut Isik, Ritwik Giri, Jean-Marc Valin, Michael M. Goodwin, Arvindh Krishnaswamy. 111-115 [doi]
- Don't Separate, Learn To Remix: End-To-End Neural Remixing With Joint OptimizationHaici Yang, Shivani Firodiya, Nicholas J. Bryan, Minje Kim. 116-120 [doi]
- Few-Shot Musical Source SeparationYu Wang 0105, Daniel Stoller, Rachel M. Bittner, Juan Pablo Bello. 121-125 [doi]
- Source Separation By Steering Pretrained Music ModelsEthan Manilow, Patrick O'Reilly, Prem Seetharaman, Bryan Pardo. 126-130 [doi]
- Infant Crying Detection In Real-World EnvironmentsXuewen Yao, Megan Micheletti, Mckensey Johnson, Edison Thomaz, Kaya de Barbaro. 131-135 [doi]
- Wikitag: Wikipedia-Based Knowledge Embeddings Towards Improved Acoustic Event ClassificationQin Zhang, Qingming Tang, Chieh-Chi Kao, Ming Sun 0007, Yang Liu, Chao Wang 0018. 136-140 [doi]
- Urban Sound & Sight: Dataset And Benchmark For Audio-Visual Urban Scene UnderstandingMagdalena Fuentes, Bea Steers, Pablo Zinemanas, Martín Rocamora, Luca Bondi, Julia Wilkins, Qianyi Shi, Yao Hou, Samarjit Das, Xavier Serra, Juan Pablo Bello. 141-145 [doi]
- Real-World On-Board Uav Audio Data Set For Propeller AnomaliesSai Srinadhu Katta, Kide Vuojärvi, Sivaprasad Nandyala, Ulla-Maria Kovalainen, Lauren Baddeley. 146-150 [doi]
- Vocalsound: A Dataset for Improving Human Vocal Sounds RecognitionYuan Gong, Jin Yu, James R. Glass. 151-155 [doi]
- Wearable Seld Dataset: Dataset For Sound Event Localization And Detection Using Wearable Devices Around HeadKento Nagatomo, Masahiro Yasuda, Kohei Yatabe, Shoichiro Saito, Yasuhiro Oikawa. 156-160 [doi]
- Tunet: A Block-Online Bandwidth Extension Model Based On Transformers And Self-Supervised PretrainingViet Anh Nguyen, Anh H. T. Nguyen, Andy W. H. Khong. 161-165 [doi]
- DRC-NET: Densely Connected Recurrent Convolutional Neural Network for Speech DereverberationJinjiang Liu, Xueliang Zhang. 166-170 [doi]
- Customizable End-To-End Optimization Of Online Neural Network-Supported Dereverberation For Hearing DevicesJean-Marie Lemercier, Joachim Thiemann, Raphael Koning, Timo Gerkmann. 171-175 [doi]
- Importance of Switch Optimization Criterion in Switching WPE DereverberationNaoyuki Kamo, Rintaro Ikeshita, Keisuke Kinoshita, Tomohiro Nakatani. 176-180 [doi]
- Audio-To-Symbolic Arrangement Via Cross-Modal Music Representation LearningZiyu Wang 0008, Dejing Xu, Gus Xia, Ying Shan. 181-185 [doi]
- Music Phrase Inpainting Using Long-Term Representation and Contrastive LossShiqi Wei, Gus Xia, Yixiao Zhang, Liwei Lin, Weiguo Gao. 186-190 [doi]
- Melons: Generating Melody With Long-Term Structure Using Transformers And Structure GraphYi Zou, Pei Zou, Yi Zhao, Kaixiang Zhang, Ran Zhang, Xiaorui Wang. 191-195 [doi]
- Difficulty-Aware Neural Band-to-Piano Score Arrangement based on Note- and Statistic-Level CriteriaMoyu Terao, Yuki Hiramatsu, Ryoto Ishizuka, Yiming Wu, Kazuyoshi Yoshii. 196-200 [doi]
- Score Difficulty Analysis for Piano Performance Education based on FingeringPedro Ramoneda, Nazif Can Tamer, Vsevolod Eremenko, Xavier Serra, Marius Miron. 201-205 [doi]
- A Neural Network-based Howling Detection Method for Real-Time Communication ApplicationsZhipeng Chen, Yiya Hao, Yaobin Chen, Gong Chen, Liang Ruan. 206-210 [doi]
- Alarm Sound Detection Using Topological Signal ProcessingTomer Fireaizen, Saar Ron, Omer Bobrowski. 211-215 [doi]
- A Method For Estimating The Grouping Of Participants In Classroom Group Work Using Only Audio InformationOsamu Ichikawa, Yuuto Shima, Takahiro Nakayama, Hajime Shirouzu. 216-220 [doi]
- Environmental Sound Extraction Using Onomatopoeic WordsYuki Okamoto, Shota Horiguchi, Masaaki Yamamoto, Keisuke Imoto, Yohei Kawaguchi. 221-225 [doi]
- Echo-Aware Adaptation of Sound Event Localization and Detection in Unknown EnvironmentsMasahiro Yasuda, Yasunori Ohishi, Shoichiro Saito. 226-230 [doi]
- On Adversarial Robustness Of Large-Scale Audio Visual LearningJuncheng B. Li, Shuhui Qu, Xinjian Li, Bernie Po-Yao Huang, Florian Metze. 231-235 [doi]
- Adversarial Sample Detection for Speaker Verification by Neural VocodersHaibin Wu, Po-Chun Hsu, Ji Gao, Shanshan Zhang, Shen Huang, Jian Kang 0006, Zhiyong Wu 0001, Helen Meng, Hung-yi Lee. 236-240 [doi]
- Amicable Examples for Informed Source SeparationNaoya Takahashi, Yuki Mitsufuji. 241-245 [doi]
- Multi-Modal Pre-Training for Automated Speech RecognitionDavid M. Chan, Shalini Ghosh, Debmalya Chakrabarty, Björn Hoffmeister. 246-250 [doi]
- Speaker-Targeted Audio-Visual Speech Recognition Using a Hybrid CTC/Attention Model with Interference LossRyota Tsunoda, Ryo Aihara, Ryoichi Takashima, Tetsuya Takiguchi, Yoshie Imai. 251-255 [doi]
- Time-Domain Audio-Visual Speech Separation on Low Quality VideosYifei Wu, Chenda Li, Jinfeng Bai, Zhongqin Wu, Yanmin Qian. 256-260 [doi]
- Complex-Valued Spatial Autoencoders for Multichannel Speech EnhancementMhd Modar Halimeh, Walter Kellermann. 261-265 [doi]
- Multichannel Noise Reduction Using Dilated Multichannel U-Net and Pre-Trained Single-Channel NetworkZhi-Wei Tan, Anh H. T. Nguyen, Yuan Liu, Andy W. H. Khong. 266-270 [doi]
- One Model to Enhance Them All: Array Geometry Agnostic Multi-Channel Personalized Speech EnhancementHassan Taherian, Sefik Emre Eskimez, Takuya Yoshioka, Huaming Wang, Zhuo Chen 0006, Xuedong Huang 0001. 271-275 [doi]
- Multi-Channel Speech Denoising for Machine EarsCong Han, Emine Merve Kaya, Kyle Hoefer, Malcolm Slaney, Simon Carlile. 276-280 [doi]
- Localization based Sequential Grouping for Continuous Speech SeparationZhong-qiu Wang, DeLiang Wang. 281-285 [doi]
- Convolutional Weighted Minimum Mean Square Error Filter for Joint Source Separation and DereverberationMieszko Fras, Marcin Witkowski, Konrad Kowalczyk. 286-290 [doi]
- Improving Source Separation by Explicitly Modeling Dependencies between SourcesEthan Manilow, Curtis Hawthorne, Cheng-Zhi Anna Huang, Bryan Pardo, Jesse H. Engel. 291-295 [doi]
- Music Source Separation With Deep Equilibrium ModelsYuichiro Koyama, Naoki Murata, Stefan Uhlich, Giorgio Fabbro, Shusuke Takahashi, Yuki Mitsufuji. 296-300 [doi]
- Harmonic and Percussive Sound Separation Based on Mixed Partial Derivative of Phase SpectrogramNatsuki Akaishi, Kohei Yatabe, Yasuhiro Oikawa. 301-305 [doi]
- On Loss Functions and Evaluation Metrics for Music Source SeparationEnric Gusó, Jordi Pons, Santiago Pascual, Joan Serrà. 306-310 [doi]
- Time-Balanced Focal Loss for Audio Event DetectionSangwook Park, Mounya Elhilali. 311-315 [doi]
- Multi-ACCDOA: Localizing And Detecting Overlapping Sounds From The Same Class With Auxiliary Duplicating Permutation Invariant TrainingKazuki Shimada, Yuichiro Koyama, Shusuke Takahashi, Naoya Takahashi, Emiru Tsunoo, Yuki Mitsufuji. 316-320 [doi]
- Improved Representation Learning For Acoustic Event Classification Using Tree-Structured OntologyArman Zharmagambetov, Qingming Tang, Chieh-Chi Kao, Qin Zhang, Ming Sun 0007, Viktor Rozgic, Jasha Droppo, Chao Wang 0018. 321-325 [doi]
- Temporal Contrastive-Loss for Audio Event DetectionSandeep Kothinti, Mounya Elhilali. 326-330 [doi]
- A Frame Loss of Multiple Instance Learning for Weakly Supervised Sound Event DetectionXu Wang, Xiangjinzi Zhang, Yunfei Zi, Shengwu Xiong. 331-335 [doi]
- Pseudo Strong Labels for Large Scale Weakly Supervised Audio TaggingHeinrich Dinkel, Zhiyong Yan, Yongqing Wang, Junbo Zhang, Yujun Wang. 336-340 [doi]
- Individualized Hear-Through For Acoustic Transparency Using PCA-Based Sound Pressure Estimation At The EardrumWenyu Jin 0002, Tim Schoof, Henning F. Schepker. 341-345 [doi]
- On Spectral and Temporal Sparsification of Speech Signals for the Improvement of Speech Perception in CI ListenersBenjamin Lentz, Rainer Martin 0001, Kirsten Oberländer, Christiane Völter. 346-350 [doi]
- A Differentiable Optimisation Framework for The Design of Individualised DNN-based Hearing-Aid StrategiesFotios Drakopoulos, Sarah Verhulst. 351-355 [doi]
- Personalized speech enhancement: new models and Comprehensive evaluationSefik Emre Eskimez, Takuya Yoshioka, Huaming Wang, Xiaofei Wang, Zhuo Chen 0006, Xuedong Huang 0001. 356-360 [doi]
- Dynamic Sliding Window for Realtime Denoising NetworksJinxu Xiang, Yuyang Zhu, Rundi Wu, Ruilin Xu 0001, Yuko Ishiwaka, Changxi Zheng. 361-365 [doi]
- Bloom-Net: Blockwise Optimization for Masking Networks Toward Scalable and Efficient Speech EnhancementSunwoo Kim 0003, Minje Kim. 366-370 [doi]
- HGCN: Harmonic Gated Compensation Network for Speech EnhancementTianrui Wang, Weibin Zhu, Yingying Gao, Junlan Feng, Shilei Zhang. 371-375 [doi]
- Speech Enhancement with Neural Homomorphic SynthesisWenbin Jiang, Zhijun Liu, Kai Yu, Fei Wen. 376-380 [doi]
- A Bayesian Permutation Training Deep Representation Learning Method for Speech Enhancement with Variational AutoencoderYang Xiang, Jesper Lisby Højvang, Morten Højfeldt Rasmussen, Mads Græsbøll Christensen. 381-385 [doi]
- Integrating Statistical Uncertainty into Neural Network-Based Speech EnhancementHuajian Fang, Tal Peer, Stefan Wermter, Timo Gerkmann. 386-390 [doi]
- Unsupervised Speech Enhancement with Speech Recognition Embedding and Disentanglement LossesViet Anh Trinh, Sebastian Braun. 391-395 [doi]
- Musicyolo: A Sight-Singing Onset/Offset Detection Framework Based on Object Detection Instead of Spectrum FramesXianke Wang, Wei Xu, Weiming Yang, Wenqing Cheng. 396-400 [doi]
- Modeling Beats and Downbeats with a Time-Frequency TransformerYun-Ning Hung, Ju-Chiang Wang, Xuchen Song, Wei Tsung Lu, Minz Won. 401-405 [doi]
- Hierarchical Classification of Singing Activity, Gender, and Type in Complex Music RecordingsMichael Krause 0002, Meinard Müller. 406-410 [doi]
- Deepchorus: A Hybrid Model of Multi-Scale Convolution And Self-Attention for Chorus DetectionQiqi He, Xiaoheng Sun, Yi Yu 0001, Wei Li 0012. 411-415 [doi]
- To Catch A Chorus, Verse, Intro, or Anything Else: Analyzing a Song with Structural FunctionsJu-Chiang Wang, Yun-Ning Hung, Jordan B. L. Smith. 416-420 [doi]
- A Novel 1D State Space for Efficient Music Rhythmic AnalysisMojtaba Heydari, Matthew McCallum, Andreas Ehmann, Zhiyao Duan. 421-425 [doi]
- Upmixing Via Style Transfer: A Variational Autoencoder for Disentangling Spatial Images And Musical ContentHaici Yang, Sanna Wager, Spencer Russell, Mike Luo, Minje Kim, Wontak Kim. 426-430 [doi]
- Spatial Mixup: Directional Loudness Modification as Data Augmentation for Sound Event Localization and DetectionRicardo Falcón Pérez, Kazuki Shimada, Yuichiro Koyama, Shusuke Takahashi, Yuki Mitsufuji. 431-435 [doi]
- Towards Faster Continuous Multi-Channel HRTF Measurements Based On Learning System ModelsTobias Kabzinski, Peter Jax. 436-440 [doi]
- Towards Fast And Convenient End-To-End HRTF PersonalizationBowen Zhi, Dmitry N. Zotkin, Ramani Duraiswami. 441-445 [doi]
- Wishart Localization Prior On Spatial Covariance Matrix In Ambisonic Source Separation Using Non-Negative Tensor FactorizationMateusz Guzik, Konrad Kowalczyk. 446-450 [doi]
- Improving Lyrics Alignment Through Joint Pitch DetectionJiawen Huang, Emmanouil Benetos, Sebastian Ewert. 451-455 [doi]
- Learning Music Audio Representations Via Weak Language SupervisionIlaria Manco, Emmanouil Benetos, Elio Quinton, György Fazekas. 456-460 [doi]
- On the Prediction of the Frequency Response of a Wooden Plate from Its Mechanical ParametersDavid Giuseppe Badiane, Raffaele Malvermi, Sebastian Gonzalez, Fabio Antonacci, Augusto Sarti. 461-465 [doi]
- Automatic DJ Transitions with Differentiable Audio Effects and Generative Adversarial NetworksBo-Yu Chen, Wei-Han Hsu, Wei-Hsiang Liao, Marco A. Martínez Ramírez, Yuki Mitsufuji, Yi-Hsuan Yang. 466-470 [doi]
- Self-Supervised Representation Learning for Unsupervised Anomalous Sound Detection Under Domain ShiftHan Chen, Yan Song, Li-Rong Dai 0001, Ian McLoughlin 0001, Lin Liu. 471-475 [doi]
- Federated Self-Training for Data-Efficient Audio RecognitionVasileios Tsouvalas, Aaqib Saeed, Tanir Ozcelebi. 476-480 [doi]
- Federated Self-Supervised Learning for Acoustic Event ClassificationMeng Feng, Chieh-Chi Kao, Qingming Tang, Ming Sun 0007, Viktor Rozgic, Spyros Matsoukas, Chao Wang 0018. 481-485 [doi]
- Temporal Knowledge Distillation for on-device Audio ClassificationKwangHee Choi, Martin Kersner, Jacob Morton, Buru Chang. 486-490 [doi]
- Streaming on-Device Detection of Device Directed Speech from Voice and Touch-Based InvocationOgnjen (Oggi) Rudovic, Akanksha Bindal, Vineet Garg, Pramod Simha, Pranay Dighe, Sachin Kajarekar. 491-495 [doi]
- Multi-Frame Full-Rank Spatial Covariance Analysis for Underdetermined BSS in Reverberant EnvironmentsHiroshi Sawada, Rintaro Ikeshita, Keisuke Kinoshita, Tomohiro Nakatani. 496-500 [doi]
- Flow-Based Fast Multichannel Nonnegative Matrix Factorization for Blind Source SeparationAditya Arie Nugraha, Kouhei Sekiguchi, Mathieu Fontaine 0002, Yoshiaki Bando, Kazuyoshi Yoshii. 501-505 [doi]
- Harvesting Partially-Disjoint Time-Frequency Information for Improving Degenerate Unmixing Estimation TechniqueYudong He, He Wang, Qifeng Chen, Richard H. Y. So. 506-510 [doi]
- Investigation And Comparison of Optimization Methods for Variational Autoencoder-Based Underdetermined Multichannel Source SeparationShogo Seki, Hirokazu Kameoka, Li Li 0063. 511-515 [doi]
- HBP: An Efficient Block Permutation Solver Using Hungarian Algorithm and Spectrogram Inpainting for Multichannel Audio Source SeparationLi Li 0063, Hirokazu Kameoka, Shogo Seki. 516-520 [doi]
- EAD-Conformer: a Conformer-Based Encoder-Attention-Decoder-Network for Multi-Task Audio Source SeparationChenxing Li, Yang Wang, Feng Deng, Zhuo Zhang, Xiaorui Wang, Zhongyuan Wang. 521-525 [doi]
- The Cocktail Fork Problem: Three-Stem Audio Separation for Real-World SoundtracksDarius Petermann, Gordon Wichern, Zhong-qiu Wang, Jonathan Le Roux. 526-530 [doi]
- Phase Shifted Bedrosian Filterbank: An Interpretable Audio Front-End for Time-Domain Audio Source SeparationFélix Mathieu, Thomas Courtat, Gaël Richard, Geoffroy Peeters. 531-535 [doi]
- Harmonicity Plays a Critical Role in DNN Based Versus in Biologically-Inspired Monaural Speech Segregation SystemsRahil Parikh, Ilya Kavalerov, Carol Y. Espy-Wilson, Shihab Shamma. 536-540 [doi]
- Multi-Channel Narrow-Band Deep Speech Separation with Full-Band Permutation Invariant TrainingChangsheng Quan, Xiaofei Li. 541-545 [doi]
- Csenet: Complex Squeeze-and-Excitation Network for Speech Depression Level PredictionCunhang Fan, Zhao Lv, Shengbing Pei, Mingyue Niu. 546-550 [doi]
- Ubilung: Multi-Modal Passive-Based Lung Health AssessmentEbrahim Nemati, Xuhai Xu, Viswam Nathan, Korosh Vatanparvar, Tousif Ahmed, Md. Mahbubur Rahman, Daniel McCaffrey, Jilong Kuang, Alex Gao 0001. 551-555 [doi]
- The Second Dicova Challenge: Dataset and Performance Analysis for Diagnosis of Covid-19 Using AcousticsNeeraj Kumar Sharma, Srikanth Raj Chetupalli, Debarpan Bhattacharya, Debottam Dutta, Pravin Mote, Sriram Ganapathy. 556-560 [doi]
- Supervised and Self-Supervised Pretraining Based Covid-19 Detection Using Acoustic Breathing/Cough/Speech SignalsXing-yu Chen, Qiu-Shi Zhu, Jie Zhang, Li-Rong Dai 0001. 561-565 [doi]
- Exploring Auditory Acoustic Features for The Diagnosis of Covid-19Madhu R. Kamble, Jose Patino 0001, Maria A. Zuluaga, Massimiliano Todisco. 566-570 [doi]
- Fast-Rir: Fast Neural Diffuse Room Impulse Response GeneratorAnton Ratnarajah, Shi-Xiong Zhang, Meng Yu 0003, Zhenyu Tang 0001, Dinesh Manocha, Dong Yu 0001. 571-575 [doi]
- Region-to-Region Kernel Interpolation of Acoustic Transfer Function with Directional WeightingJuliano G. C. Ribeiro, Shoichi Koyama, Hiroshi Saruwatari. 576-580 [doi]
- Blind Reverberation Time Estimation in Dynamic Acoustic ConditionsPhilipp Götz, Cagdas Tuna, Andreas Walther, Emanuël A. P. Habets. 581-585 [doi]
- Sparse Modeling of The Early Part of Noisy Room Impulse Responses with Sparse Bayesian LearningMaozhong Fu, Jesper Rindom Jensen, Yuhan Li, Mads Græsbøll Christensen. 586-590 [doi]
- Improved Simulation of Realistically-Spatialised Simultaneous Speech Using Multi-Camera Analysis in The Chime-5 DatasetJack Deadman, Jon Barker. 591-595 [doi]
- A Data-Driven Approach for Acoustic Parameter Similarity Estimation of Speech RecordingMattia Papa, Clara Borrelli, Paolo Bestagini, Fabio Antonacci, Augusto Sarti, Stefano Tubaro. 596-600 [doi]
- Violinist Identification Using Note-Level Timbre Feature DistributionsYudong Zhao, György Fazekas, Mark B. Sandler. 601-605 [doi]
- S3T: Self-Supervised Pre-Training with Swin Transformer For Music ClassificationHang Zhao, Chen Zhang, Bilei Zhu, Zejun Ma, Kejun Zhang. 606-610 [doi]
- Ambiguity Modelling with Label Distribution Learning for Music ClassificationMorgan Buisson, Pablo Alonso-Jiménez, Dmitry Bogdanov. 611-615 [doi]
- Bytecover2: Towards Dimensionality Reduction of Latent Embedding for Efficient Cover Song IdentificationXingjian Du, Ke Chen 0021, Zijie Wang, Bilei Zhu, Zejun Ma. 616-620 [doi]
- Tonet: Tone-Octave Network for Singing Melody Extraction from Polyphonic MusicKe Chen 0021, Shuai Yu, Cheng-i Wang, Wei Li, Taylor Berg-Kirkpatrick, Shlomo Dubnov. 621-625 [doi]
- Hierarchical Graph-Based Neural Network for Singing Melody ExtractionShuai Yu, Xi Chen, Wei Li. 626-630 [doi]
- On The Impact of Normalization Strategies in Unsupervised Adversarial Domain Adaptation for Acoustic Scene ClassificationMichel Olvera, Emmanuel Vincent 0001, Gilles Gasso. 631-635 [doi]
- Improving Bird Classification with Unsupervised Sound SeparationTom Denton, Scott Wisdom, John R. Hershey. 636-640 [doi]
- Scalable Neural Architectures for End-to-End Environmental Sound ClassificationFrancesco Paissan, Alberto Ancilotto, Alessio Brutti, Elisabetta Farella. 641-645 [doi]
- HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and DetectionKe Chen 0021, Xingjian Du, Bilei Zhu, Zejun Ma, Taylor Berg-Kirkpatrick, Shlomo Dubnov. 646-650 [doi]
- Hybrid Attention-Based Prototypical Networks for Few-Shot Sound ClassificationYou Wang, David V. Anderson. 651-655 [doi]
- End-to-End Complex-Valued Multidilated Convolutional Neural Network for Joint Acoustic Echo Cancellation and Noise SuppressionKarn N. Watcharasupat, Thi Ngoc Tho Nguyen, Woon-Seng Gan, Shengkui Zhao, Bin Ma. 656-660 [doi]
- NN3A: Neural Network Supported Acoustic Echo Cancellation, Noise Suppression and Automatic Gain Control for Real-Time CommunicationsZiteng Wang, Yueyue Na, Biao Tian, Qiang Fu 0001. 661-665 [doi]
- Deep Residual Echo Suppression and Noise Reduction: A Multi-Input FCRN Approach in a Hybrid Speech Enhancement SystemJan Franzen, Tim Fingscheidt. 666-670 [doi]
- Neural Cascade Architecture for Joint Acoustic Echo and Noise SuppressionHao Zhang, DeLiang Wang. 671-675 [doi]
- Cascade Multi-Channel Noise Reduction and Acoustic Feedback CancellationSantiago Ruiz, Toon van Waterschoot, Marc Moonen. 676-680 [doi]
- Skim: Skipping Memory Lstm for Low-Latency Real-Time Continuous Speech SeparationChenda Li, Lei Yang, Weiqin Wang, Yanmin Qian. 681-685 [doi]
- Adapting Speech Separation to Real-World Meetings using Mixture Invariant TrainingAswin Sivaraman, Scott Wisdom, Hakan Erdogan, John R. Hershey. 686-690 [doi]
- Quantifying Discriminability between NMF BasesEisuke Konno, Daisuke Saito, Nobuaki Minematsu. 691-695 [doi]
- Location-Based Training for Multi-Channel Talker-Independent Speaker SeparationHassan Taherian, Ke Tan 0001, DeLiang Wang. 696-700 [doi]
- SDR - Medium Rare with Fast ComputationsRobin Scheibler. 701-705 [doi]
- Attentionpit: Soft Permutation Invariant Training for Audio Source Separation with Attention MechanismHirokazu Kameoka, Shogo Seki, Li Li, Chihiro Watanabe. 706-710 [doi]
- Locate This, Not that: Class-Conditioned Sound Event DOA EstimationOlga Slizovskaia, Gordon Wichern, Zhong-qiu Wang, Jonathan Le Roux. 711-715 [doi]
- SALSA-Lite: A Fast and Effective Feature for Polyphonic Sound Event Localization and Detection with Microphone ArraysThi Ngoc Tho Nguyen, Douglas L. Jones, Karn N. Watcharasupat, Huy Phan, Woon-Seng Gan. 716-720 [doi]
- SRP-DNN: Learning Direct-Path Phase Difference for Multiple Moving Sound Source LocalizationBing Yang, Hong Liu, Xiaofei Li. 721-725 [doi]
- Closed-Form Single Source Direction-of-Arrival Estimator Using First-Order Relative Harmonic CoefficientsYonggang Hu, Sharon Gannot. 726-730 [doi]
- A Slide-Save Based Framework for Multi-Source DOA Extraction with Closely Spaced SourcesJianhua Geng, Sifan Wang, Xin Lou. 731-735 [doi]
- An End-to-End Deep Learning Framework For Multiple Audio Source Separation And LocalizationYu Chen, Bowen Liu, Zijian Zhang, Hun-Seok Kim. 736-740 [doi]
- Deep Adaptation Control for Acoustic Echo CancellationAmir Ivry, Israel Cohen, Baruch Berdugo. 741-745 [doi]
- Off-the-Shelf Deep Integration For Residual-Echo SuppressionAmir Ivry, Israel Cohen, Baruch Berdugo. 746-750 [doi]
- A Complex Spectral Mapping with Inplace Convolution Recurrent Neural Networks For Acoustic Echo CancellationChenggang Zhang, Jinjiang Liu, Xueliang Zhang. 751-755 [doi]
- Deep Adaptive Aec: Hybrid of Deep Learning and Adaptive Acoustic Echo CancellationHao Zhang, Srivatsan Kandadai, Harsha Rao, Minje Kim, Tarun Pruthi, Trausti Kristjansson. 756-760 [doi]
- Computationally Efficient Fixed-Filter ANC for Speech Based on Long-Term Prediction for Headphone ApplicationsYurii Iotov, Sidsel Marie Nørholm, Valiantsin Belyi, Mads Dyrholm, Mads Græsbøll Christensen. 761-765 [doi]
- End-To-End Deep Learning-Based Adaptation Control for Frequency-Domain Adaptive System IdentificationThomas Haubner, Andreas Brendel, Walter Kellermann. 766-770 [doi]
- A Few-Sample Strategy for Guitar Tablature Transcription Based on Inharmonicity Analysis and Playability ConstraintsGrigoris Bastas, Stefanos Koutoupis, Maximos A. Kaliakatsos-Papakostas, Vassilis Katsouros, Petros Maragos. 771-775 [doi]
- Exploring Transformer's Potential on Automatic Piano TranscriptionLongshen Ou, Ziyi Guo, Emmanouil Benetos, Jiqing Han, Ye Wang. 776-780 [doi]
- A Lightweight Instrument-Agnostic Model for Polyphonic Note Transcription and Multipitch EstimationRachel M. Bittner, Juan José Bosch, David Rubinstein, Gabriel Meseguer-Brocal, Sebastian Ewert. 781-785 [doi]
- Towards Automatic Transcription of Polyphonic Electric Guitar Music: A New Dataset and a Multi-Loss Transformer ModelYu-Hua Chen, Wen-Yi Hsiao, Tsu-Kuang Hsieh, Jyh-Shing Roger Jang, Yi-Hsuan Yang. 786-790 [doi]
- Genre-Conditioned Acoustic Models for Automatic Lyrics Transcription of Polyphonic MusicXiaoxue Gao, Chitralekha Gupta, Haizhou Li 0001. 791-795 [doi]
- Pseudo-Label Transfer from Frame-Level to Note-Level in a Teacher-Student Framework for Singing Transcription from Polyphonic MusicSangeun Kum, Jongpil Lee, Keunhyoung Luke Kim, Taehyoung Kim, Juhan Nam. 796-800 [doi]
- Sound Event Detection Guided by Semantic Contexts of ScenesNoriyuki Tonami, Keisuke Imoto, Ryotaro Nagase, Yuki Okamoto, Takahiro Fukumori, Yoichi Yamashita. 801-805 [doi]
- CNN-Transformer with Self-Attention Network for Sound Event DetectionKeigo Wakayama, Shoichiro Saito. 806-810 [doi]
- A Mutual Learning Framework for Few-Shot Sound Event DetectionDongchao Yang, Helin Wang, Yuexian Zou, Zhongjie Ye, Wenwu Wang. 811-815 [doi]
- Anomalous Sound Detection Using Spectral-Temporal Information FusionYoude Liu, Jian Guan, Qiaoxi Zhu, Wenwu Wang. 816-820 [doi]
- Sparse Self-Attention for Semi-Supervised Sound Event DetectionYadong Guan, Jiabin Xue, Guibin Zheng, Jiqing Han. 821-825 [doi]
- Peer Collaborative Learning for Polyphonic Sound Event DetectionHayato Endo, Hiromitsu Nishizaki. 826-830 [doi]
- PostGAN: A GAN-Based Post-Processor to Enhance the Quality of Coded SpeechSrikanth Korse, Nicola Pia, Kishan Gupta, Guillaume Fuchs. 831-835 [doi]
- A DNN Based Post-Filter to Enhance the Quality of Coded Speech in MDCT DomainKishan Gupta, Srikanth Korse, Bernd Edler, Guillaume Fuchs. 836-840 [doi]
- A Two-Stage U-Net for High-Fidelity Denoising of Historical RecordingsEloi Moliner, Vesa Välimäki. 841-845 [doi]
- Experts Versus All-Rounders: Target Language Extraction for Multiple Target LanguagesMarvin Borsdorf, Kevin Scheck, Haizhou Li 0001, Tanja Schultz. 846-850 [doi]
- Category-Adapted Sound Event Enhancement with Weakly Labeled DataGuangwei Li, Xuenan Xu, Heinrich Dinkel, Mengyue Wu, Kai Yu 0004. 851-855 [doi]
- Sequential MCMC Methods for Audio Signal EnhancementRubén M. Clavería, Simon J. Godsill. 856-860 [doi]
- Architecture for Variable Bitrate Neural Speech Codec with Configurable Computation ComplexityTejas Jayashankar, Thilo Köhler, Kaustubh Kalgaonkar, Zhiping Xiu, Jilong Wu, Ju Lin, Prabhav Agrawal, Qing He. 861-865 [doi]
- End-to-End Neural Speech Coding for Real-Time CommunicationsXue Jiang, Xiulian Peng, Chengyu Zheng, Huaying Xue, Yuan Zhang 0013, Yan Lu. 866-870 [doi]
- Deep Neural Network (DNN) Audio Coder Using A Perceptually Improved Training MethodSeungmin Shin, Joon Byun, Youngcheol Park, Jongmo Sung, Seungkwon Beack. 871-875 [doi]
- Progressive Multi-Stage Neural Audio Coding with Guided ReferencesChanwoo Lee, Hyungseob Lim, Jihyun Lee, Inseon Jang, Hong-Goo Kang. 876-880 [doi]
- Vocbench: A Neural Vocoder Benchmark for Speech SynthesisEhab A. AlBadawy, Andrew Gibiansky, Qing He, Jilong Wu, Ming-Ching Chang, Siwei Lyu. 881-885 [doi]
- Dnsmos P.835: A Non-Intrusive Perceptual Objective Speech Quality Metric to Evaluate Noise SuppressorsChandan K. A. Reddy, Vishak Gopal, Ross Cutler. 886-890 [doi]
- SQAPP: No-Reference Speech Quality Assessment Via Pairwise PreferencePranay Manocha, Zeyu Jin, Adam Finkelstein. 891-895 [doi]
- LDNet: Unified Listener Dependent Modeling in MOS Prediction for Synthetic SpeechWen-Chin Huang, Erica Cooper, Junichi Yamagishi, Tomoki Toda. 896-900 [doi]
- AECMOS: A Speech Quality Assessment Metric for Echo ImpairmentMarju Purin, Sten Sootla, Mateja Sponza, Ando Saabas, Ross Cutler. 901-905 [doi]
- MOS Predictor for Synthetic Speech with I-Vector InputsMiao Liu, Jing Wang, Shicong Li, Fei Xiang, Yue Yao, Lidong Yang. 906-910 [doi]
- Wave-Domain Approach for Cancelling Noise Entering Open WindowsDaan Ratering, W. Bastiaan Kleijn, Jean Gonzalez Silva, Riccardo M. G. Ferrari. 911-915 [doi]
- On Synchronization of Wireless Acoustic Sensor Networks in the Presence of Time-Varying Sampling Rate Offsets and Speaker ChangesTobias Gburrek, Joerg Schmalenstroeer, Reinhold Haeb-Umbach. 916-920 [doi]
- Picknet: Real-Time Channel Selection for Ad Hoc Microphone ArraysTakuya Yoshioka, Xiaofei Wang, Dongmei Wang. 921-925 [doi]
- End-To-End Alexa Device ArbitrationJarred Barber, Yifeng Fan, Tao Zhang. 926-930 [doi]
- Instantaneous Linear Dimensionality Reduction of Multichannel Time-Series Signal for Array Signal ProcessingNatsuki Ueno, Nobutaka Ono. 931-935 [doi]
- Generalized Time Domain Velocity VectorSrdan Kitic, Jérôme Daniel. 936-940 [doi]
- Differentiable Digital Signal Processing Mixture Model for Synthesis Parameter Extraction from Mixture of Harmonic SoundsMasaya Kawamura, Tomohiko Nakamura, Daichi Kitamura, Hiroshi Saruwatari, Yu Takahashi, Kazunobu Kondo. 941-945 [doi]
- The Mirrornet : Learning Audio Synthesizer Controls Inspired by Sensorimotor InteractionYashish M. Siriwardena, Guilhem Marion, Shihab Shamma. 946-950 [doi]
- Deep Performer: Score-to-Audio Music Performance SynthesisHao-Wen Dong, Cong Zhou, Taylor Berg-Kirkpatrick, Julian J. McAuley. 951-955 [doi]
- KaraSinger: Score-Free Singing Voice Synthesis with VQ-VAE Using Mel-SpectrogramsChien-Feng Liao, Jen-Yu Liu, Yi-Hsuan Yang. 956-960 [doi]
- Adversarial Audio Synthesis Using a Harmonic-Percussive DiscriminatorJihyun Lee, Hyungseob Lim, Chanwoo Lee, Inseon Jang, Hong-Goo Kang. 961-965 [doi]
- SleepGAN: Towards Personalized Sleep Therapy MusicJing Yang, Chulhong Min, Akhil Mathur, Fahim Kawsar. 966-970 [doi]
- Diversity-Controllable and Accurate Audio Captioning Based on Neural ConditionXuenan Xu, Mengyue Wu, Kai Yu 0004. 971-975 [doi]
- Audioclip: Extending Clip to Image, Text and AudioAndrey Guzhov, Federico Raue, Jörn Hees, Andreas Dengel 0001. 976-980 [doi]
- Can Audio Captions Be Evaluated With Image Caption Metrics?Zelin Zhou, Zhiling Zhang, Xuenan Xu, Zeyu Xie, Mengyue Wu, Kenny Q. Zhu. 981-985 [doi]
- A Data-Driven Cognitive Salience Model for Objective Perceptual Audio Quality AssessmentPablo M. Delgado, Jürgen Herre. 986-990 [doi]
- Improving Character Error Rate is Not Equal to Having Clean Speech: Speech Enhancement for ASR Systems with Black-Box Acoustic ModelsRyosuke Sawata, Yosuke Kashiwagi, Shusuke Takahashi. 991-995 [doi]
- Effect of Noise Suppression Losses on Speech Distortion and ASR PerformanceSebastian Braun, Hannes Gamper. 996-1000 [doi]
- Increasing Loudness in Audio Signals: A Perceptually Motivated Approach to Preserve Audio QualityA. Jeannerot, N. de Koeijer, P. Martínez-Nuevo, M. B. Møller, J. Dyreby, P. Prandoni. 1001-1005 [doi]
- Audio Peak Reduction Using a Synced allpass FilterSebastian J. Schlecht, Leonardo Fierro, Vesa Välimäki, Juha Backman. 1006-1010 [doi]
- APPLADE: Adjustable Plug-and-Play Audio Declipper Combining DNN with Sparse OptimizationTomoro Tanaka, Kohei Yatabe, Masahiro Yasuda, Yasuhiro Oikawa. 1011-1015 [doi]
- Maximizing Audio Event Detection Model Performance on Small Datasets Through Knowledge Transfer, Data Augmentation, and Pretraining: an Ablation StudyDaniel Tompkins, Kshitiz Kumar, Jian Wu. 1016-1020 [doi]
- Threshold Independent Evaluation of Sound Event Detection ScoresJanek Ebbers, Reinhold Haeb-Umbach, Romain Serizel. 1021-1025 [doi]
- Multimodal Evaluation Method for Sound Event DetectionSeyed M. R. Modaresi, Aomar Osmani, Mohammadreza Razzazi, Abdelghani Chibani. 1026-1030 [doi]
- A Benchmark of State-of-the-Art Sound Event Detection Systems Evaluated on Synthetic SoundscapesFrancesca Ronchini, Romain Serizel. 1031-1035 [doi]
- Attentive Max Feature Map and Joint Training for Acoustic Scene ClassificationHye-jin Shim, Jee-weon Jung, Ju-ho Kim, Ha-Jin Yu. 1036-1040 [doi]
- ORCA-PARTY: An Automatic Killer Whale Sound Type Separation Toolkit Using Deep LearningChristian Bergler, Manuel Schmitt, Andreas K. Maier, Rachael Xi Cheng, Volker Barth, Elmar Nöth. 1046-1050 [doi]
- Sparsity-Based Sound Field Separation in the Spherical Harmonics DomainMirco Pezzoli, Maximo Cobos, Fabio Antonacci, Augusto Sarti. 1051-1055 [doi]
- Spatial Active Noise Control Based on Individual Kernel Interpolation of Primary and Secondary Sound FieldsKazuyuki Arikawa, Shoichi Koyama, Hiroshi Saruwatari. 1056-1060 [doi]
- Time-Domain Acoustic Contrast Control with A Spatial Uniformity Constraint for Personal Audio SystemsSipei Zhao, Ian S. Burnett. 1061-1065 [doi]
- Generation of Personal Sound Fields in Reverberant Environments Using Interframe CorrelationLiming Shi, Guoli Ping, Xiaoxiang Shen, Mads Græsbøll Christensen. 1066-1070 [doi]
- Variable Span Trade-Off Filter for Sound Zone Control with Kernel Interpolation WeightingJesper Brunnström, Shoichi Koyama, Marc Moonen. 1071-1075 [doi]
- Time Domain Radial Filter Design for Spherical WavesNara Hahn, Frank-Schultz, Sascha Spors. 1076-1080 [doi]
- Feature Space Message Passing Network for Medical Image Semantic SegmentationJunxiao Sun, Ke Zhang, Shuyi Niu, Yan Zhang, Youyong Kong. 1081-1085 [doi]
- Cross-Domain Few-Shot Learning for Rare-Disease Skin Lesion SegmentationYixin Wang, Zhe Xu, Jiang Tian, Jie Luo, Zhongchao Shi, Yang Zhang, Jianping Fan 0007, Zhiqiang He. 1086-1090 [doi]
- Adaptive Pseudo Labeling for Source-Free Domain Adaptation in Medical Image SegmentationChen Li, Wei Chen, Xin Luo, Yulin He, Yusong Tan. 1091-1095 [doi]
- Object Detection and Tracking in Ultrasound Scans Using an Optical Flow and Semantic Segmentation Framework Based on Convolutional Neural NetworksAbdullah F. Al-Battal, Imanuel R. Lerman, Truong Q. Nguyen. 1096-1100 [doi]
- Heuristic Dropout: An Efficient Regularization Method for Medical Image Segmentation ModelsDachuan Shi, Ruiyang Liu, Linmi Tao, Chun Yuan. 1101-1105 [doi]
- Superresolution and Segmentation of OCT Scans Using Multi-Stage Adversarial Guided Attention TrainingParia Jeihouni, Omid Dehzangi, Annahita Amireskandari, Ali Dabouei, Ali Rezai, Nasser M. Nasrabadi. 1106-1110 [doi]
- Heart Rate and Oxygen Saturation Estimation from Facial Video with Multimodal Physiological Data GenerationYusuke Akamatsu, Yoshifumi Onishi, Hitoshi Imaoka. 1111-1115 [doi]
- EMGSE: Acoustic/EMG Fusion for Multimodal Speech EnhancementKuan-Chen Wang, Kai-Chun Liu, Hsin-Min Wang, Yu Tsao 0001. 1116-1120 [doi]
- A Dilated Residual Vision Transformer for Atrial Fibrillation Detection from Stacked Time-Frequency ECG RepresentationsSawon Pratiher, Apoorva Srivastava, Yedla Bindu Priyatha, Nirmalya Ghosh, Amit Patra. 1121-1125 [doi]
- Contrastive Heartbeats: Contrastive Learning for Self-Supervised ECG Representation and PhenotypingCrystal T. Wei, Ming-En Hsieh, Chien-Liang Liu, Vincent S. Tseng. 1126-1130 [doi]
- Ubiquitous Physiological Prediction of SUD Patients' Wellness State Using Memory-Based Convolutional ModelsOmid Dehzangi, Paria Jeihouni, Jad Ramadan, Victor S. Finomore, Nasser M. Nasrabadi, Ali Rezai. 1131-1135 [doi]
- Joint Hypoglycemia Prediction and Glucose Forecasting via Deep Multi-Task LearningMu Yang, Darpit Dave, Madhav Erraguntla, Gerard L. Coté, Ricardo Gutierrez-Osuna. 1136-1140 [doi]
- SegNet-Based Deep Representation Learning for Dysphagia ClassificationSiddharth Subramani, Achuth Rao M. V, Anwesha Roy, Prasanna Suresh Hegde, Prasanta Kumar Ghosh. 1141-1145 [doi]
- Robust Collaborative Learning for Sequence ModellingFrancois Buet-Golfouse, Hans Roggeman, Islam Utyagulov. 1146-1150 [doi]
- A Self-Supervised Pre-Training Framework for Vision-Based Seizure ClassificationJen-Cheng Hou, Aileen McGonigal, Fabrice Bartolomei, Monique Thonnat. 1151-1155 [doi]
- Design of Real-Time System Based on Machine Learning for Snoring and OSA DetectionHuaiwen Luo, Lu Zhang, Lianyu Zhou, Xu Lin, Zehuai Zhang, Mingjiang Wang. 1156-1160 [doi]
- Parametric Modeling of Human Wrist for Bioimpedance-Based Physiological SensingKaan Sel, Noah Huerta, Michael S. Sacks, Roozbeh Jafari. 1161-1165 [doi]
- Preliminary Results on the Generation of Artificial Handwriting Data Using a Decomposition-Recombination StrategyJosé Fernando Adrán Otero, Oscar Soláns Caballer, Pere Martí-Puig, Zhe Sun, Toshihisa Tanaka, Jordi Solé-Casals. 1166-1170 [doi]
- A Style Transfer Mapping and Fine-Tuning Subject Transfer Framework Using Convolutional Neural Networks for Surface Electromyogram Pattern RecognitionSuguru Kanoga, Takayuki Hoshino, Mitsunori Tada. 1171-1175 [doi]
- Feature-Based Sensing Matrix Design for Analog to Information ConvertersChencheng Guo, Hui Qian 0002, Baoling Hong. 1176-1180 [doi]
- ALSNet: A Dilated 1-D CNN for Identifying ALS from Raw EMG SignalK. M. Naimul Hassan, Md. Shamiul Alam Hridoy, Naima Tasnim, Atia Faria Chowdhury, Tanvir Alam Roni, Sheikh Tabrez, Arik Subhana, Celia Shahnaz. 1181-1185 [doi]
- Joint Model Order Estimation for Multiple Tensors with A Coupled Mode and Applications to the Joint Decomposition of EEG, MEG Magnetometer, and Gradiometer TensorsBilal Ahmad, Liana Khamidullina, Alexey Alexandrovich Korobkov, Alla Manina, Jens Haueisen, Martin Haardt. 1186-1190 [doi]
- An Experimental Study on Transferring Data-Driven Image Compressive Sensing to Bioelectric SignalsZhikang Zhang, Jonathan Zhao, Fengbo Ren. 1191-1195 [doi]
- Hand Gesture Recognition Using Temporal Convolutions and Attention MechanismElahe Rahimian, Soheil Zabihi, Amir Asif, Dario Farina, Seyed Farokh Atashzar, Arash Mohammadi 0001. 1196-1200 [doi]
- Combining Multiple Style Transfer Networks and Transfer Learning For LGE-CMR SegmentationBo Fang, Junxin Chen, Wei Wang, Yicong Zhou. 1201-1205 [doi]
- Multi-Domain Unpaired Ultrasound Image Artifact Removal Using a Single Convolutional Neural NetworkJaeyoung Huh, Shujaat Khan, Jong Chul Ye. 1206-1210 [doi]
- Improving Ultrasound Image Classification with Local Texture QuantisationXiao Li, Huizhi Liang, Sidhartha Nagala, Jane Chen. 1211-1215 [doi]
- Accelerated Intravascular Ultrasound Imaging using Deep Reinforcement LearningTristan S. W. Stevens, Nishith Chennakeshava, Frederik J. de Bruijn, Martin Pekar, Ruud J. G. van Sloun. 1216-1220 [doi]
- Deep Proximal Unfolding For Image Recovery from Under-Sampled Channel Data in Intravascular UltrasoundNishith Chennakeshava, Tristan S. W. Stevens, Frederik J. de Bruijn, Andrew Hancock, Martin Pekar, Yonina C. Eldar, Massimo Mischi, Ruud J. G. van Sloun. 1221-1225 [doi]
- Multiview Long-Short Spatial Contrastive Learning For 3D Medical Image AnalysisGongpeng Cao, Yiping Wang, Manli Zhang, Jing Zhang, Guixia Kang, Xin Xu. 1226-1230 [doi]
- Composing Graphical Models with Generative Adversarial Networks for EEG Signal ModelingKhuong Vo, Manoj Vishwanath, Ramesh Srinivasan, Nikil D. Dutt, Hung Cao. 1231-1235 [doi]
- Domain-Invariant Representation Learning from EEG with Private EncodersDavid Bethge, Philipp Hallgarten, Tobias Grosse-Puppendahl, Mohamed Kari, Ralf Mikut, Albrecht Schmidt 0001, Ozan Özdenizci. 1236-1240 [doi]
- Holistic Semi-Supervised Approaches for EEG Representation LearningGuangyi Zhang 0003, Ali Etemad. 1241-1245 [doi]
- Music Identification Using Brain Responses to Initial SnippetsPankaj Pandey, Gulshan Sharma, Krishna P. Miyapuram, Ramanathan Subramanian, Derek Lomas. 1246-1250 [doi]
- Multi-Level Spatial-Temporal Adaptation Network for Motor Imagery ClassificationWei Xu, Jing Wang, Ziyu Jia, Zhiqing Hong, Yunze Li, Youfang Lin. 1251-1255 [doi]
- Learning Subject-Invariant Representations from Speech-Evoked EEG Using Variational AutoencodersLies Bollens, Tom Francart, Hugo Van Hamme. 1256-1260 [doi]
- Unsupervised Hierarchical Translation-Based Model for Multi-Modal Medical Image RegistrationXinru Dai, Tai Ma, Haibin Cai, Ying Wen. 1261-1265 [doi]
- FAZ-BV: A Diabetic Macular Ischemia Grading Framework Combining Faz Attention Network and Blood Vessel Enhancement FiltersZailiang Chen, Hailei Lan, Yongan Meng, Yuchen Xiong, Jing Luo, Hailan Shen. 1266-1270 [doi]
- Fracture Detection and Localization in Chest X-Rays Using Semi-Supervised Learning with Dynamic SharpeningLijuan Lu, Shun Miao, Ling Ye. 1271-1275 [doi]
- Histokt: Cross Knowledge Transfer in Computational PathologyRyan Zhang, Jiadai Zhu, Stephen Yang, Mahdi S. Hosseini, Angelo Genovese, Lina Chen, Corwyn Rowsell, Savvas Damaskinos, Sonal Varma, Konstantinos N. Plataniotis. 1276-1280 [doi]
- Unsupervised Deep Learning Network for Deformable Fundus Image RegistrationGiovana Augusta Benvenuto, Marilaine Colnago, Wallace Casaca. 1281-1285 [doi]
- A Minimally Supervised Approach for Medical Image Quality Assessment in Domain Shift SettingsHuijuan Yang, Aaron S. Coyner, Feri Guretno, Ivan Ho Mien, Chuan-Sheng Foo, J. Peter Campbell, Susan Ostmo, Michael F. Chiang, Pavitra Krishnaswamy. 1286-1290 [doi]
- A Channel Attention Based MLP-Mixer Network for Motor Imagery Decoding With EEGYanbin He, Zhiyang Lu, Jun Wang, Jun Shi 0004. 1291-1295 [doi]
- Towards Closed-Loop Speech Synthesis from Stereotactic EEG: A Unit Selection ApproachMiguel Angrick, Maarten C. Ottenhoff, Lorenz Diener, Darius Ivucic, Gabriel Ivucic, Sophocles Goulis, Albert J. Colon, Louis Wagner, Dean J. Krusienski, Pieter L. Kubben, Tanja Schultz, Christian Herff. 1296-1300 [doi]
- Enhancing Contextual Encoding With Stage-Confusion and Stage-Transition Estimation for EEG-Based Sleep StagingJauen Phyo, Wonjun Ko, Eunjin Jeon, Heung-Il Suk. 1301-1305 [doi]
- Improving BCI-based Color Vision Assessment Using Gaussian Process RegressionHadi Habibzadeh, Kevin J. Long, Ally E. Atkins, Daphney-Stavroula Zois, James J. S. Norton. 1306-1310 [doi]
- Transformer-Based Estimation of Spoken Sentences Using ElectrocorticographyShuji Komeiji, Kai Shigemi, Takumi Mitsuhashi, Yasushi Iimura, Hiroharu Suzuki, Hidenori Sugano, Koichi Shinoda, Toshihisa Tanaka. 1311-1315 [doi]
- Boost Ensemble Learning for Classification of CTG SIGNALSMarzieh Ajirak, Cassandra Heiselman, J. Gerald Quirk, Petar M. Djuric. 1316-1320 [doi]
- Multi-View Learning Based on Non-Redundant Fusion for Icu Patient Mortality PredictionYifan Wang, Ying Lan. 1321-1325 [doi]
- Improving Phase-Rectified Signal Averaging for Fetal Heart Rate AnalysisTong Chen, Guanchao Feng, Cassandra Heiselman, J. Gerald Quirk, Petar M. Djuric. 1326-1330 [doi]
- Unsupervised Clustering and Analysis of Contraction-Dependent Fetal Heart Rate SegmentsLiu Yang, Cassandra Heiselman, J. Gerald Quirk, Petar M. Djuric. 1331-1335 [doi]
- A Method for Detecting Coronary Artery Disease using Noisy Ultrashort Electrocardiogram RecordingsOrestis Apostolou, Vasileios S. Charisis, Georgios Apostolidis, Leontios J. Hadjileontiadis. 1336-1340 [doi]
- Multi-Task Gaussian Process Regression for the Detection of Sleep Cycles in Premature InfantsNele Sophie Brügge, Jan Graßhoff, Arne Weigenand, Philipp Rostalski. 1341-1345 [doi]
- Fast Low Rank Column-Wise Compressive Sensing For Accelerated Dynamic MRISilpa Babu, Seyedehsara Nayer, Sajan Goud Lingala, Namrata Vaswani. 1346-1350 [doi]
- MRI Recovery with a Self-Calibrated DenoiserSizhuo Liu, Philip Schniter, Rizwan Ahmad. 1351-1355 [doi]
- 3d Cross-Scale Feature Transformer Network for Brain Mr Image Super-ResolutionWanqi Zhang, Lulu Wang, Wei Chen, Yuanyuan Jia, Zhongshi He, Jinglong Du. 1356-1360 [doi]
- Data Efficient Support Vector Machine Training Using the Minimum Description Length PrincipleHarsh Singh, Ognjen Arandjelovic. 1361-1365 [doi]
- Multiple Instance Learning with Task-Specific Multi-Level Features for Weakly Annotated Histopathological Image ClassificationYuanpin Zhou, Yao Lu. 1366-1370 [doi]
- Self-Knowledge Distillation based Self-Supervised Learning for Covid-19 Detection from Chest X-Ray ImagesGuang Li 0008, Ren Togo, Takahiro Ogawa 0001, Miki Haseyama. 1371-1375 [doi]
- Pixel-Level and Affinity-Level Knowledge Distillation for Unsupervised Segmentation of Covid-19 LesionsRui Xu 0002, Yufeng Wang, Xinchen Ye, Pengcheng Wu, Yen-Wei Chen 0001, Fangyi Xu, Wenchao Zhu, Chao Chen, Yong Zhou, Hongjie Hu, Xiaofeng Qu, Shoji Kido, Noriyuki Tomiyama. 1376-1380 [doi]
- Data Shapley Value for Handling Noisy Labels: An Application in Screening Covid-19 Pneumonia from Chest CT ScansNastaran Enshaei, Moezedin Javad Rafiee, Arash Mohammadi 0001, Farnoosh Naderkhani. 1381-1385 [doi]
- Accurate Multiscale Selective Fusion of CT and Video Images for Real-Time Endoscopic Camera 3D Tracking in Robotic SurgeryXiongbiao Luo. 1386-1390 [doi]
- Learning Deep Pathological Features for WSI-Level Cervical Cancer GradingRuixiang Geng, Qing Liu, Shuo Feng, Yixiong Liang. 1391-1395 [doi]
- Selective Scale Cascade Attention Network for Breast Cancer Histopathology Image ClassificationBowen Xu, Wenqiang Zhang. 1396-1400 [doi]
- Frequency-Specific Non-Linear Granger Causality in a Network of Brain SignalsArchishman Biswas, Hernando Ombao. 1401-1405 [doi]
- Epileptic Spike Detection by Recurrent Neural Networks with Self-Attention MechanismKosuke Fukumori, Noboru Yoshida, Hidenori Sugano, Madoka Nakajima, Toshihisa Tanaka. 1406-1410 [doi]
- Topological Correlation of Brain SignalsJian Yin 0020, Yuan Wang. 1411-1415 [doi]
- Online Detection of Scalp-Invisible Mesial-Temporal Brain Interictal Epileptiform Discharges from EEGBahman Abdi-Sargezeh, Antonio Valentín, Gonzalo Alarcón, Saeid Sanei. 1416-1420 [doi]
- Leveraging Sparse Coding for EEG Based Emotion Recognition in ShootingYulu Wang, Yiwen Sun, Lei Fang, Changshui Zhang. 1421-1425 [doi]
- A Novel Unsupervised Autoencoder-Based HFOs Detector in Intracranial EEG SignalsWeilai Li, Lanfeng Zhong, Weixi Xiang, Tongzhou Kang, Dakun Lai. 1426-1430 [doi]
- A Novel Convolutional Neural Network Based on Adaptive Multi-Scale Aggregation and Boundary-Aware for Lateral Ventricle Segmentation on MR imagesFei Ye, Zhiqiang Wang, Sheng Zhu, Xuanya Li, Kai Hu 0002. 1431-1435 [doi]
- Multiscale Attention Aggregation Network for 2D Vessel SegmentationWenTao Liu, Huihua Yang, Tong Tian, Xipeng Pan, Weijin Xu. 1436-1440 [doi]
- TCRNet: Make Transformer, CNN and RNN Complement Each OtherXinxin Shan, Tai Ma, Anqi Gu, Haibin Cai, Ying Wen. 1441-1445 [doi]
- Double Noise Mean Teacher Self-Ensembling Model for Semi-Supervised Tumor SegmentationKe Zheng, Junhai Xu, Jianguo Wei. 1446-1450 [doi]
- Rethinking Computer-Aided Pelvis SegmentationSiming Yuan, Qing Liu, Shenghui Liao, Fuchang Han, Haitao Wei, Yingqi Zhang. 1451-1455 [doi]
- Vision Transformer-Based Retina Vessel Segmentation with Deep Adaptive Gamma CorrectionHyunwoo Yu, Jae-hun Shim, Jaeho Kwak, Jou Won Song, Suk-Ju Kang. 1456-1460 [doi]
- Spectral Permutation Test on Persistence DiagramsYuan Wang, Moo K. Chung, Julius Fridriksson. 1461-1465 [doi]
- Multi-Task fMRI Data Fusion Using IVA and PARAFAC2Isabell Lehmann, Evrim Acar, Tanuj Hasija, Mohammad A. B. S. Akhonda, Vince D. Calhoun, Peter J. Schreier, Tülay Adali. 1466-1470 [doi]
- Independent Vector Analysis Based Subgroup Identification from Multisubject fMRI DataH. Yang, Mohammad A. B. S. Akhonda, F. Ghayem, Qunfang Long, Vince D. Calhoun, Tülay Adali. 1471-1475 [doi]
- Improving Brain Decoding Methods and EvaluationDamian Pascual, Béni Egressy, Nicolas Affolter, Yiming Cai, Oliver Richter, Roger Wattenhofer. 1476-1480 [doi]
- Cmri2spec: Cine MRI Sequence to Spectrogram Synthesis via A Pairwise Heterogeneous TranslatorXiaofeng Liu 0001, Fangxu Xing, Maureen Stone, Jerry L. Prince, Jangwon Kim, Georges El Fakhri, Jonghye Woo. 1481-1485 [doi]
- Spatio-Temporal Attention Graph Convolution Network for Functional Connectome ClassificationWenhan Wang, Youyong Kong, Zhenghua Hou, Chunfeng Yang, Yonggui Yuan. 1486-1490 [doi]
- Bilevel Learning of ℓ1 Regularizers with Closed-Form Gradients (BLORC)Avrajit Ghosh, Michael T. McCann, Saiprasad Ravishankar. 1491-1495 [doi]
- Multiband Image Fusion with Controllable Error GuaranteesV. S. Unni, Ruturaj G. Gavaskar, Kunal N. Chaudhury. 1496-1500 [doi]
- Weighted Graph Embedded Low-Rank Projection Learning for Feature ExtractionZhuojie Huang, Shuping Zhao, Lunke Fei, Jigang Wu. 1501-1505 [doi]
- ADMM-DAD Net: A Deep Unfolding Network for Analysis Compressed SensingVasiliki Kouni, Georgios Paraskevopoulos, Holger Rauhut, George C. Alexandropoulos. 1506-1510 [doi]
- High-Dimensional Sparse Bayesian Learning without Covariance MatricesAlexander Lin, Andrew H. Song, Berkin Bilgic, Demba E. Ba. 1511-1515 [doi]
- A Trainable Bounded Denoiser Using Double Tight Frame Network for Snapshot Compressive ImagingBaoshun Shi, Yuxin Wang, Qiusheng Lian. 1516-1520 [doi]
- Progressive Image Super-Resolution via Neural Differential EquationSeobin Park, Tae-Hyun Kim. 1521-1525 [doi]
- High-Quality Self-Supervised Snapshot Hyperspectral ImagingYuhui Quan, Xinran Qin, Mingqin Chen, Yan Huang. 1526-1530 [doi]
- Robust Bayesian Reconstruction of Multispectral Single-Photon 3D Lidar Data with Non-Uniform BackgroundAbderrahim Halimi, Jakeoung Koo, Robert A. Lamb, Gerald S. Buller, Steve McLaughlin 0001. 1531-1535 [doi]
- Joint Calibration and Mapping of Satellite Altimetry Data Using Trainable Variational ModelsQuentin Febvre, Ronan Fablet, Julien Le Sommer, Clément Ubelmann. 1536-1540 [doi]
- 4D Convolutional Neural Networks for Multi-Spectral and Multi-Temporal Remote Sensing Data ClassificationMichalis Giannopoulos, Grigorios Tsagkatakis, Panagiotis Tsakalides. 1541-1545 [doi]
- A New Deep Learning Method for Multispectral Image Time Series Completion Using Hyperspectral DataC. T. Cissé, A. Alboody, M. Puigt, G. Roussel, V. Vantrepotte, C. Jamet, T. K. Tran. 1546-1550 [doi]
- Image Denoising with Deep Unfolding And Normalizing FlowsXinyi Wei, Hans Van Gorp, Lizeth Gonzalez-Carabarin, Daniel Freedman, Yonina C. Eldar, Ruud J. G. van Sloun. 1551-1555 [doi]
- 3D Texture Super Resolution via the Rendering LossRohit Ranade, Yangwen Liang, Shuangquan Wang, Dongwoon Bai, Jungwon Lee. 1556-1560 [doi]
- Bundle ICP with Virtual Depth for Hand-Held 3d ScannerChanghun Sung, Byungdeok Kim. 1561-1565 [doi]
- Sketched RT3D: How to Reconstruct Billions of Photons Per SecondJulián Tachella, Michael P. Sheehan, Mike E. Davies. 1566-1570 [doi]
- A Generic Method to Estimate Camera Extrinsic ParametersNaveen Kuruba, Neel Badadare, Vikram Narayan, Satish Putta. 1571-1575 [doi]
- Photon-Limited Deblurring Using Algorithm UnrollingYash Sanghvi, Abhiram Gnanasambandan, Stanley H. Chan. 1576-1580 [doi]
- +: Novel View Synthesis with Neural Regularisation Over Multi-Plane ImagesWenpeng Xing, Jie Chen. 1581-1585 [doi]
- Compressive Scanning Transmission Electron MicroscopyD. Nicholls, A. Robinson, J. Wells, A. Moshtaghpour, M. Bahri, A. Kirkland, N. Browning. 1586-1590 [doi]
- Deep Iterative Phase Retrieval for PtychographySimon Welker, Tal Peer, Henry N. Chapman, Timo Gerkmann. 1591-1595 [doi]
- Compressive Phase Retrieval Based On Sparse Latent Generative PriorsVinayak Killedar, Chandra Sekhar Seelamantula. 1596-1600 [doi]
- Model-Based Reconstruction for Collimated Beam Ultrasound SystemsAbdulrahman Alanazi, Singanallur V. Venkatakrishnan, Hector J. Santos-Villalobos, Gregery T. Buzzard, Charles A. Bouman. 1601-1605 [doi]
- Learned Acoustic Reconstruction Using Synthetic Aperture FocusingTim Straubinger, Robert Xiao, Helge Rhodin. 1606-1610 [doi]
- SDETR: Attention-Guided Salient Object Detection with TransformerGuanze Liu, Bo Xu, Han Huang, Cheng Lu 0006, Yandong Guo. 1611-1615 [doi]
- Evaluation of Video Coding for Machines without Ground TruthKristian Fischer, Markus Hofbauer, Christopher B. Kuhn, Eckehard G. Steinbach, André Kaup. 1616-1620 [doi]
- Raw Plenoptic Video Coding Under Hexagonal Lattice Resolution of Motion VectorsThuc Nguyen Huu, Vinh Van Duong, Jonghoon Yim, Byeungwoo Jeon. 1621-1624 [doi]
- Comparison of Boundary Artifact Removal Methods in Coding of Generalized Cubemap Projection Using VVCKianoush Jafari, Alireza Aminlou, Miska M. Hannuksela. 1625-1629 [doi]
- Low-Complexity Multi-Model CNN in-Loop Filter for AVS3Shen Wang, Yibing Fu, Chen Zhu, Li Song, Wenjun Zhang. 1630-1634 [doi]
- Unified Matrix Coding for NN Originated MIP in H.266/VVCJunyan Huo, Yu Sun, Haixin Wang, Shuai Wan, Fuzheng Yang, Ming Li. 1635-1639 [doi]
- FOV-Based Coding Optimization for 360-Degree Virtual Reality VideosYuanyuan Xu, Taoyu Yang, Zengjie Tan, Haolun Lan. 1640-1644 [doi]
- Multi-Hierarchy Proxy Structure for Deep Metric LearningJian Wang, Xinyue Li, Wei Song, Zhichao Zhang, Weiqi Guo. 1645-1649 [doi]
- Exploiting Caption Diversity for Unsupervised Video SummarizationMichail Kaseris, Ioannis Mademlis, Ioannis Pitas. 1650-1654 [doi]
- Clustering and Separating Similarities for Deep Unsupervised HashingWanqian Zhang, Dayan Wu, Chule Yang, Bo Li, Weiping Wang 0005. 1655-1659 [doi]
- Enhancing Prototypical Few-Shot Learning By Leveraging The Local-Level StrategyJunying Huang, Fan Chen, Keze Wang, Liang Lin, Dongyu Zhang. 1660-1664 [doi]
- Blind Unmixing Using A Double Deep Image PriorChao Zhou, Miguel R. D. Rodrigues. 1665-1669 [doi]
- A New Framework for Multiple Deep Correlation Filters Based Object TrackingYi Liu, Yanjie Liang, Qiangqiang Wu, Liming Zhang 0002, Hanzi Wang. 1670-1674 [doi]
- Adaptive Actor-Critic Bilateral FilterBo-Hao Chen, Hsiang-Yin Cheng, Jia-Li Yin. 1675-1679 [doi]
- Domain Decomposition Algorithms for Real-Time Homogeneous Diffusion Inpainting in 4KNiklas Kämper, Joachim Weickert. 1680-1684 [doi]
- Deep Temporal Interpolation of Radar-Based PrecipitationMichiaki Tatsubori, Takao Moriyama, Tatsuya Ishikawa, Paolo Fraccaro, Anne Jones, Blair Edwards, Julian Kuehnert, Sekou L. Remy. 1685-1689 [doi]
- A Nonlinear Steerable Complex Wavelet Decomposition of ImagesZikai Sun, Thierry Blu. 1690-1694 [doi]
- Kernel Estimation Network for Blind Super-ResolutionXiang Cao, Haibo Shen, Liangqi Zhang, Yihao Luo, Tianjiang Wang. 1695-1699 [doi]
- Terahertz Image Restoration Benchmarking DatasetYixiong Zhang, Zhipeng Su, Feng Qi, Jianyang Zhou, Xiaoping Zhang 0003. 1700-1704 [doi]
- Binary Dense Predictors for Human Pose Estimation Based on Dynamic Thresholds and FilteringXingrun Xing, Yalong Jiang, Baochang Zhang 0001, Wenrui Ding, Yangguang Li, Hongguang Li, Huan Peng. 1705-1709 [doi]
- Self-Supervised Learning for Sentiment Analysis via Image-Text MatchingHaidong Zhu, Zhaoheng Zheng, Mohammad Soleymani 0001, Ram Nevatia. 1710-1714 [doi]
- Domain-Agnostic Meta-Learning for Cross-Domain Few-Shot ClassificationWei-Yu Lee, Jheng-Yu Wang, Yu-Chiang Frank Wang. 1715-1719 [doi]
- Semantic Association Network for Video Corpus Moment RetrievalDahYun Kim, Sunjae Yoon, Ji Woo Hong, Chang D. Yoo. 1720-1724 [doi]
- Statistical, Spectral and Graph Representations for Video-Based Facial Expression Recognition in ChildrenNida Itrat Abbasi, Siyang Song, Hatice Gunes. 1725-1729 [doi]
- Deriving Explainable Discriminative Attributes Using Confusion About Counterfactual ClassNakyeong Yang, Taegwan Kang, Kyomin Jung. 1730-1734 [doi]
- Realistic Monocular-To-3d Virtual Try-On Via Multi-Scale Characteristics CaptureChenghu Du, Feng Yu 0017, Minghua Jiang, Yaxin Zhao, Xiong Wei, Tao Peng 0006, Xinrong Hu. 1735-1739 [doi]
- Optimizing Latent Space Directions for Gan-Based Local Image EditingEhsan Pajouheshgar, Tong Zhang, Sabine Süsstrunk. 1740-1744 [doi]
- Towards Using Clothes Style Transfer for Scenario-Aware Person Video GenerationJingning Xu, Benlai Tang, Mingjie Wang, Siyuan Bian, Wenyi Guo, Xiang Yin 0006, Zejun Ma. 1745-1749 [doi]
- Multi-Domain Unsupervised Image-to-Image Translation with Appearance Adaptive ConvolutionSomi Jeong, Jiyoung Lee, Kwanghoon Sohn. 1750-1754 [doi]
- VR-FAM: Variance-Reduced Encoder with Nonlinear Transformation for Facial Attribute ManipulationYifan Yuan, Siteng Ma, Junping Zhang. 1755-1759 [doi]
- Wavelet-Based Unsupervised Label-to-Image TranslationGeorge Eskandar, Mohamed Abdelsamad, Karim Armanious, Shuai Zhang, Bin Yang 0009. 1760-1764 [doi]
- Fast Graph Sampling for Short Video Summarization Using Gershgorin Disc AlignmentSadid Sahami, Gene Cheung, Chia-Wen Lin. 1765-1769 [doi]
- Towards Practical and Efficient Long Video SummaryXiaopeng Ke, Boyu Chang, Hao Wu, Fengyuan Xu, Sheng Zhong 0002. 1770-1774 [doi]
- Cut And Continuous Paste Towards Real-Time Deep Fall DetectionSunhee Hwang, Minsong Ki, Seung-Hyun Lee, Sanghoon Park, Byoung-Ki Jeon. 1775-1779 [doi]
- Mannet: A Large-Scale Manipulated Image Detection Dataset And Baseline EvaluationsAditya Singh, Saheb Chhabra, Puspita Majumdar, Richa Singh 0001, Mayank Vatsa. 1780-1784 [doi]
- Approaches Toward Physical and General Video Anomaly DetectionLaura Kart, Niv Cohen. 1785-1789 [doi]
- Considering User Agreement in Learning to Predict the Aesthetic QualitySuiyi Ling, Andreas Pastor, Junle Wang, Patrick Le Callet. 1790-1794 [doi]
- No-Reference Quality Assessment of Variable Frame-Rate Videos Using Temporal Bandpass StatisticsQi Zheng, Zhengzhong Tu, Yibo Fan, Xiaoyang Zeng, Alan C. Bovik. 1795-1799 [doi]
- Towards Joint Frame-Level and MOS Quality Predictions with Low-Complexity Objective ModelsJoel Jung, Alexandre Giraud, Meijia Song, Songnan Li, Xiang Li, Shan Liu. 1800-1804 [doi]
- Teaching CNNs to Mimic Human Visual Cognitive Process & Regularise Texture-Shape BiasSatyam Mohla, Anshul Nasery, Biplab Banerjee. 1805-1809 [doi]
- Subjective And Objective Quality Assessment Of Mobile Gaming VideoShaoguo Wen, Suiyi Ling, Junle Wang, Ximing Chen, Yanqing Jing, Patrick Le Callet. 1810-1814 [doi]
- ER-PIQA: A Task-Guided Pedestrian Image Quality Assessment Via Embedding ReconstructionYanzhe Zhong, Huadong Pan, Bangjie Tang, Zhonggeng Liu, Yiming Zhu, Jun Yin. 1815-1819 [doi]
- Multiscale Crowd Counting and Localization By Multitask Point SupervisionMohsen Zand, Haleh Damirchi, Andrew Farley, Mahdiyar Molahasani, Michael A. Greenspan, Ali Etemad. 1820-1824 [doi]
- Super-Resolution of Satellite Images by two-Dimensional RRDB and Edge-Enhancement Generative Adversarial NetworkYu-Zhang Chen, Tsung-Jung Liu, Kuan-Hsien Liu. 1825-1829 [doi]
- Leveraging Local Temporal Information for Multimodal Scene ClassificationSaurabh Sahu, Palash Goyal. 1830-1834 [doi]
- Predicting Human Motion Using Key SubsequencesMenghao Li, Mingtao Pei, Wei Liang. 1835-1839 [doi]
- Dynamic Texture Recognition Using PDV Hashing and Dictionary Learning on Multi-Scale Volume Local Binary PatternRuxin Ding, Jianfeng Ren, Heng Yu, Jiawei Li 0001. 1840-1844 [doi]
- Do You Live a Healthy Life? Analyzing Lifestyle by Visual Life LoggingQing Gao, Mingtao Pei, Hongyu Shen. 1845-1849 [doi]
- Weighted Wavelet-Based Spectral-Spatial Transforms For CFA-Sampled Raw Camera Image Compression Considering Image FeaturesLiping Huang, Taizo Suzuki. 1850-1854 [doi]
- Jmpnet: Joint Motion Prediction for Learning-Based Video CompressionDongyang Li, Zhenhong Sun, Zhiyu Tan, Xiuyu Sun, Fangyi Zhang, Yichen Qian, Hao Li 0030. 1855-1859 [doi]
- A Low-Parametric Model for Bit-Rate Estimation of VVC Residual CodingFabian Brand, Christian Herglotz, André Kaup. 1860-1864 [doi]
- OPTE: Online Per-Title Encoding for Live Video StreamingVignesh V. Menon, Hadi Amirpour, Mohammed Ghanbari 0001, Christian Timmerer. 1865-1869 [doi]
- SADN: Learned Light Field Image Compression with Spatial-Angular DecorrelationKedeng Tong, Xin Jin, Chen Wang, Fan Jiang. 1870-1874 [doi]
- Hierarchical Feature Aggregation Network for Deep Image CompressionWenfeng Li, Zongcai Du, Hao He, Jie Tang 0006, Gangshan Wu. 1875-1879 [doi]
- Accurate Instance Segmentation Via Collaborative LearningTianyou Chen, Xiaoguang Hu, Jin Xiao, Guofeng Zhang 0002, Shaojie Wang. 1880-1884 [doi]
- Dynamic Binary Neural Network by Learning Channel-Wise ThresholdsJiehua Zhang, Zhuo Su 0002, Yanghe Feng, Xin Lu, Matti Pietikäinen, Li Liu 0002. 1885-1889 [doi]
- Self-Supervised Learning on A Lightweight Low-Light Image Enhancement Model with Curve RefinementWanyu Wu, Wei Wang, Kui Jiang, Xin Xu, Ruimin Hu. 1890-1894 [doi]
- Semantically Proportional Patchmix for Few-Shot LearningJingquan Wang, Jing Xu, Yu Pan 0005, Zenglin Xu. 1895-1899 [doi]
- Noise Suppression for Improved Few-Shot LearningZhikui Chen, Tiandong Ji, Suhua Zhang, Fangming Zhong. 1900-1904 [doi]
- Online Continual Learning Using Enhanced Random Vector Functional Link NetworksCheryl Sze Yin Wong, Guo Yang, ArulMurugan Ambikapathi, Savitha Ramasamy. 1905-1909 [doi]
- A Generalized Kernel Risk Sensitive Loss for Robust Two-Dimensional Singular Value DecompositionMiaohua Zhang, Yongsheng Gao 0001, Jun Zhou 0001. 1910-1914 [doi]
- Video Frame Interpolation via Local Lightweight Bidirectional Encoding with Channel Attention CascadeXiangling Ding, Pu Huang, Dengyong Zhang, Xianfeng Zhao. 1915-1919 [doi]
- Sain: Similarity-Aware Video Frame InterpolationYue Lv, Wenming Yang, Wangmeng Zuo, Qingmin Liao, Rui Zhu 0006. 1920-1924 [doi]
- Self-Learned Video Super-Resolution with Augmented Spatial and Temporal ContextZejia Fan, Jiaying Liu 0001, Wenhan Yang, Wei Xiang, Zongming Guo. 1925-1929 [doi]
- Deformable Convolution Dense Network for Compressed Video Quality EnhancementJiahui Liu, Mingcai Zhou, Meng Xiao. 1930-1934 [doi]
- Convolutional ISTA Network with Temporal Consistency Constraints for Video Reconstruction from Event CamerasSiying Liu, Roxana Alexandru, Pier Luigi Dragotti. 1935-1939 [doi]
- PMP-NET: Rethinking Visual Context for Scene Graph GenerationXuezhi Tong, Rui Wang 0032, Chuan Wang 0002, Sanyi Zhang, Xiaochun Cao. 1940-1944 [doi]
- Improve Image Captioning Via Relation ModelingFeicheng Huang, Zhixin Li. 1945-1949 [doi]
- Equal Loss: A Simple Loss Function for Noise Robust LearningLei Cui, Huan Peng, Yangguang Li, Chuming Li, Xingrun Xing. 1950-1954 [doi]
- Informative Attention Supervision for Grounded Video DescriptionBoyang Wan, Wenhui Jiang, Yuming Fang. 1955-1959 [doi]
- Spatial-Context-Aware Deep Neural Network for Multi-Class Image ClassificationJialu Zhang, Qian Zhang, Jianfeng Ren, Yitian Zhao, Jiang Liu. 1960-1964 [doi]
- Transtl: Spatial-Temporal Localization Transformer for Multi-Label Video ClassificationHongjun Wu, Mengzhu Li, Yongcheng Liu, Hongzhe Liu, Cheng Xu, Xuewei Li. 1965-1969 [doi]
- Deep Video Inpainting Guided by Audio-Visual Self-SupervisionKyuyeon Kim, Junsik Jung, Woo-Jae Kim, Sung-Eui Yoon. 1970-1974 [doi]
- Navigating Audio-Visual Event Detection Across Mismatched ModalitiesGuangwei Li, Xuenan Xu, Mengyue Wu, Kai Yu 0004. 1975-1979 [doi]
- Look, Listen and Pay More Attention: Fusing Multi-Modal Information for Video Violence DetectionDong-Lai Wei, Chen-Geng Liu, Yang Liu, Jing Liu, Xiao-guang Zhu, Xin-Hua Zeng. 1980-1984 [doi]
- Multi-Modal Learning with Text Merging for TEXTVQAChangsheng Xu, Zhenlong Xu, Yifan He, Shuigeng Zhou, Jihong Guan. 1985-1989 [doi]
- A Novel Part Feature Integration and Fusion Method for Fine-Grained Vehicle RecognitionPing Wang, Yijie Cao, Lei Lu. 1990-1994 [doi]
- Monocular Vehicle 3D Bounding Box Estimation Using Homograhy and Geometry in Traffic SceneYiqiang Chen, Feng Liu, Ke Pei. 1995-1999 [doi]
- FSM: Feature Sampling Module for Object DetectionXin Yi, Bo Ma, Jiahao Wu. 2000-2004 [doi]
- Rethinking Two-B-Real Net for Real-Time Salient Object DetectionSenyun Kuang, Shijin Meng, Bo Xiao, Lv Tang, Bo Li 0115. 2005-2009 [doi]
- Balanced Ranking and Sorting For Class Incremental Object DetectionBo Cui, Hui Qu, Xuhui Huang, Shan Yu. 2010-2014 [doi]
- Multi-Scale Reinforcement Learning Strategy for Object DetectionYihao Luo, Xiang Cao, Juntao Zhang, Leixilan Pan, Tianjiang Wang, Qi Feng 0003. 2015-2019 [doi]
- Deep Object Detection with Example Attribute Based Prediction ModulationZhihao Wu, Chengliang Liu, Chao Huang 0001, Jie Wen 0001, Yong Xu 0001. 2020-2024 [doi]
- Universal Efficient Variable-Rate Neural Image CompressionShanzhi Yin, Chao Li, Youneng Bao, Yongsheng Liang, Fanyang Meng, Wei Liu. 2025-2029 [doi]
- AdderIC: Towards Low Computation Cost Image CompressionBowen Li, Xin Yao 0001, Chao Li, Youneng Bao, Fanyang Meng, Yongsheng Liang. 2030-2034 [doi]
- DCNGAN: A Deformable Convolution-Based GAN with QP Adaptation for Perceptual Quality Enhancement of Compressed VideoSaiping Zhang, Luis Herranz, Marta Mrak, Marc Górriz Blanch, Shuai Wan, Fuzheng Yang. 2035-2039 [doi]
- Specialised Video Quality Model For Enhanced User Generated Content (UGC) With Special EffectsAnne-Flore Perrin, Yejing Xie, Tao Zhang, Yiting Liao, Junlin Li, Patrick Le Callet. 2040-2044 [doi]
- Improving Maximum Likelihood Difference Scaling Method To Measure Inter Content ScaleAndreas Pastor, Lukás Krasula, Xiaoqing Zhu, Zhi Li 0001, Patrick Le Callet. 2045-2049 [doi]
- Texture Information Boosts Video Quality AssessmentAo-Xiang Zhang, Yuan-Gen Wang. 2050-2054 [doi]
- Plug-and-Play and Relay Regularizations on Noisy Low Rank Tensor Completion for Snapshot Multispectral Image RestorationKeisuke Ozawa. 2055-2059 [doi]
- LERPS: Lighting Estimation and Relighting for Photometric StereoAshish Tiwari, Shanmuganathan Raman. 2060-2064 [doi]
- A Unified Two-Stage Model for Separating Superimposed ImagesHuiyu Duan, Xiongkuo Min, Wei Shen, Guangtao Zhai. 2065-2069 [doi]
- Parameter-Free Style Projection for Arbitrary Image Style TransferSiyu Huang, Haoyi Xiong, Tianyang Wang, Bihan Wen, Qingzhong Wang, Zeyu Chen, Jun Huan, Dejing Dou. 2070-2074 [doi]
- Optimization of Compressive Light Field Display in Dual-Guided LearningYangfan Sun, Zhu Li, Li Li, Shizheng Wang, Wei Gao 0003. 2075-2079 [doi]
- ARM 4-BIT PQ: SIMD-Based Acceleration for Approximate Nearest Neighbor Search on ARMYusuke Matsui, Yoshiki Imaizumi, Naoya Miyamoto, Naoki Yoshifuji. 2080-2084 [doi]
- Iterative Learning for Distorted Image RestorationChao Wang, Yi Gu, Jie Li, Xinlei He, Zirui Zhang, Yuting Gao, Chentao Wu. 2085-2089 [doi]
- 2NET: Joint Exploitation and Exploration in Reinforcement Learning Based Image RestorationXiaoyu Zhang, Wei Gao 0003, Hui Yuan, Ge Li. 2090-2094 [doi]
- Multiple Patch-Aware Network for Faster Real-World Image DehazingKun Yang, Juan Zhang, Xiaoqi Lang. 2095-2099 [doi]
- Learning to Fuse Heterogeneous Features for Low-Light Image EnhancementZhenyu Tang, Long Ma 0002, Xiaoke Shang, Xin Fan 0001. 2100-2104 [doi]
- Deep Scale-Aware Image SmoothingJiachun Li, Kunkun Qin, Ruotao Xu, Hui Ji. 2105-2109 [doi]
- A Multiscale Gradient-Backpropagation Optimization Framework for Deformable Convolution Based Compressed Video EnhancementYanbo Gao, Menghu Jia, Shuai Li 0005, Xun Cai, Mao Ye, Frédéric Dufaux. 2110-2114 [doi]
- Downstream Augmentation Generation For Contrastive LearningTomohiro Hayase, Suguru Yasutomi, Nakamasa Inoue. 2115-2119 [doi]
- Few-Shot Learning with Improved Local Representations via Bias Rectify ModuleChao Dong, Qi Ye, Wenchao Meng, Kaixiang Yang. 2120-2124 [doi]
- Image-to-Video Re-Identification via Mutual Discriminative Knowledge TransferPichao Wang, Fan Wang, Hao Li 0030. 2125-2129 [doi]
- DynSNN: A Dynamic Approach to Reduce Redundancy in Spiking Neural NetworksFangxin Liu, Wenbo Zhao, Yongbiao Chen, Zongwu Wang, Fei Dai. 2130-2134 [doi]
- MEJIGCLU: More Effective Jigsaw Clustering For Unsupervised Visual Representation LearningYongsheng Zhang, Qing Liu 0003, Yang Zhao, Yixiong Liang. 2135-2139 [doi]
- Ganet: Unary Attention Reaches Pairwise Attention Via Implicit Group Clustering in Light-Weight CNNsCheng Zhuang, Yunlian Sun. 2140-2144 [doi]
- Find The Way Back: Invertible Kernel Estimator For Blind Image Super-ResolutionTing-Wei Chang, Wei-chen Chiu, Ching-Chun Huang. 2145-2149 [doi]
- Fine-Grained Dynamic Loss for Accurate Single-Image Super-ResolutionHaoquan Wang, Gang Zhang, Zhichun Lei. 2150-2154 [doi]
- Multi-Frame Super-Resolution With Raw Images Via Modified Deformable ConvolutionGongzhe Li, Linwei Qiu, Haopeng Zhang 0001, Fengying Xie, Zhiguo Jiang. 2155-2159 [doi]
- Local-Global Feature Aggregation for Light Field Image Super-ResolutionYan Wang, Yao Lu, Shunzhou Wang, Wenyao Zhang, Zijian Wang. 2160-2164 [doi]
- Pyramid Fusion Attention Network For Single Image Super-ResolutionHao He, Zongcai Du, Wenfeng Li, Jie Tang 0006, Gangshan Wu. 2165-2169 [doi]
- VCD: View-Constraint Disentanglement for Action RecognitionXian Zhong, Zhuo Zhou, Wenxuan Liu, Kui Jiang, Xuemei Jia, Wenxin Huang, Zheng Wang 0007. 2170-2174 [doi]
- Privacy-Preserving Action RecognitionChengming Zou, Ducheng Yuan, Long Lan, Haoang Chi. 2175-2179 [doi]
- Spatio-Temporal Motion Aggregation Network for Video Action DetectionHongcheng Zhang, Xu Zhao. 2180-2184 [doi]
- TP-VIT: A Two-Pathway Vision Transformer for Video Action RecognitionYanhao Jing, Feng Wang. 2185-2189 [doi]
- Learning Task-Specific Representation for Video Anomaly Detection with Spatial-Temporal AttentionYang Liu, Jing Liu, Xiaoguang Zhu, Donglai Wei, Xiaohong Huang, Liang Song. 2190-2194 [doi]
- W-ART: Action Relation Transformer for Weakly-Supervised Temporal Action LocalizationMengzhu Li, Hongjun Wu, Yongcheng Liu, Hongzhe Liu, Cheng Xu, Xuewei Li. 2195-2199 [doi]
- MS-ROCANet: Multi-Scale Residual Orthogonal-Channel Attention Network for Scene Text DetectionJinpeng Liu, Song Wu, Dehong He, Guoqiang Xiao. 2200-2204 [doi]
- Bi-Directional Normalization and Color Attention-Guided Generative Adversarial Network for Image EnhancementShan Liu, Guoqiang Xiao, Xiaohui Xu, Song Wu. 2205-2209 [doi]
- Dual-Attention Network for Few-Shot SegmentationZhikui Chen, Han Wang, Suhua Zhang, Fangming Zhong. 2210-2214 [doi]
- Attention Guided Invariance Selection for Local Feature DescriptorsJiapeng Li, Ge Li, Thomas H. Li. 2215-2219 [doi]
- Attention Probe: Vision Transformer Distillation in the WildJiahao Wang, Mingdeng Cao, Shuwei Shi, Baoyuan Wu, Yujiu Yang. 2220-2224 [doi]
- Stacked Multi-Scale Attention Network for Image ColorizationBin Jiang 0006, Fangqiang Xu, Jun Xia, Chao Yang 0015, Wei Huang, Yun Huang. 2225-2229 [doi]
- CRPN: Distinguish Novel Categories Via Class-Relevant Region Proposal Network for Few-Shot Object DetectionHan Wang, Yali Li, Shengjin Wang. 2230-2234 [doi]
- An Efficient Framework for Detection and Recognition of Numerical Traffic SignsZhishan Li, Mingmu Chen, Yifan He, Lei Xie 0007, Hongye Su. 2235-2239 [doi]
- Divergence-Guided Feature Alignment for Cross-Domain Object DetectionZongyao Li, Ren Togo, Takahiro Ogawa 0001, Miki Haseyama. 2240-2244 [doi]
- PGTRNET: Two-Phase Weakly Supervised Object Detection with Pseudo Ground Truth RefinementJun Wang, Hefeng Zhou, Xiaohan Yu. 2245-2249 [doi]
- Novel Instance Mining with Pseudo-Margin Evaluation for Few-Shot Object DetectionWeijie Liu, Chong Wang, Shenghao Yu, Chenchen Tao, Jun Wang, Jiafei Wu. 2250-2254 [doi]
- BiP-Net: Bidirectional Perspective Strategy Based Arbitrary-Shaped Text Detection NetworkChuang Yang, Mulin Chen, Yuan Yuan, Qi Wang 0009. 2255-2259 [doi]
- A Novel Lightweight Network for Fast Monocular Depth EstimationTim Heydrich, Yimin Yang, Xiangyu Ma, Yu Liu, Shan Du. 2260-2264 [doi]
- A Lightweight Self-Supervised Training Framework for Monocular Depth EstimationTim Heydrich, Yimin Yang, Shan Du. 2265-2269 [doi]
- PU-Refiner: A Geometry Refiner with Adversarial Learning for Point Cloud UpsamplingHao Liu 0044, Hui Yuan 0001, Raouf Hamzaoui, Wei Gao 0003, Shuai Li. 2270-2274 [doi]
- CF-Net: Complementary Fusion Network for Rotation Invariant Point Cloud CompletionBo-Fan Chen, Yang-Ming Yeh, Yi-Chang Lu. 2275-2279 [doi]
- TH-Net: A Method Of Single 3d Object Tracking Based On Transformers And Hausdorff DistanceZihao Zhang, Nan Sang, Xupeng Wang. 2280-2284 [doi]
- Enrich Features for Few-Shot Point Cloud ClassificationHengxin Feng, Weifeng Liu, Yanjiang Wang 0001, Baodi Liu. 2285-2289 [doi]
- Semi-Supervised 360° Depth Estimation from Multiple Fisheye Cameras with Pixel-Level Selective LossJaewoo Lee, Daeul Park, Dongwook Lee, Daehyun Ji. 2290-2294 [doi]
- Underwater Stereo Matching Via Unsupervised Appearance And Feature Adaptation NetworksWei Zhong, Yazhi Yuan, Xinchen Ye, Dian Zheng, Rui Xu 0002. 2295-2299 [doi]
- Domain Adaptation via Mutual Information Maximization for Handwriting RecognitionPei Tang, Liangrui Peng, Ruijie Yan, Haodong Shi, Gang Yao, Changsong Liu, Jie Li, Yuqi Zhang. 2300-2304 [doi]
- Attribute-Conditioned Face Swapping Network for Low-Resolution ImagesAng Li, Jian Hu, Chilin Fu, Xiaolu Zhang, Jun Zhou. 2305-2309 [doi]
- Learning Multiple Explainable and Generalizable Cues for Face Anti-SpoofingYing Bian, Peng Zhang, Jingjing Wang, Chunmao Wang, Shiliang Pu. 2310-2314 [doi]
- Off-The-Grid Covariance-Based Super-Resolution Fluctuation MicroscopyBastien Laville, Laure Blanc-Féraud, Gilles Aubert. 2315-2319 [doi]
- Simultaneous Nonlocal Low-Rank And Deep Priors For Poisson DenoisingZhiyuan Zha, Bihan Wen, Xin Yuan 0002, Jiantao Zhou 0001, Ce Zhu. 2320-2324 [doi]
- Double Closed-Loop Network for Image DeblurringYiming Liu, Yanni Zhang, Qiang Li, Jun Kong, Miao Qi, Jianzhong Wang. 2325-2329 [doi]
- Single Image De-Raining with High-Low Frequency GuidanceYing Zhang, Youjun Xiang, Lei Cai, Yuli Fu 0001, Wanliang Huo, Junjun Xia. 2330-2334 [doi]
- Detail Generation and Fusion Networks for Image InpaintingWu Yang, Wuzhen Shi. 2335-2339 [doi]
- Adaptive Weighted Network With Edge Enhancement Module For Monocular Self-Supervised Depth EstimationHong Liu 0008, Ying Zhu, Guoliang Hua, Weibo Huang, Runwei Ding. 2340-2344 [doi]
- Pas-Mef: Multi-Exposure Image Fusion Based On Principal Component Analysis, Adaptive Well-Exposedness And Saliency MapDiclehan Karakaya, Oguzhan Ulucan, Mehmet Türkan. 2345-2349 [doi]
- PDD-Net: A Precise Defect Detection Network Based on Point Set RepresentationMiaoju Ban, Runwei Ding, Jian Zhang, Tianyu Guo 0001, Tao Wang. 2350-2354 [doi]
- Solving The Long-Tailed Problem Via Intra- And Inter-Category BalanceRenhui Zhang, Tiancheng Lin 0001, Rui Zhang, Yi Xu. 2355-2359 [doi]
- Extracting and Distilling Direction-Adaptive Knowledge for Lightweight Object Detection in Remote Sensing ImagesZhanchao Huang, Wei Li 0032, Ran Tao 0003. 2360-2364 [doi]
- Pseudo-Interacting Guided Network for Few-Shot SegmentationXiaoliu Luo, Jing Luo, Zhao Duan, Jin Tan, Taiping Zhang. 2365-2369 [doi]
- Few-Shot Generation By Modeling Stereoscopic PriorsYuehui Wang, Qing Wang, Dongyu Zhang. 2370-2374 [doi]
- Relative Viewpoint Estimation Based on Structured 3d Representation AlignmentKohei Matsuzaki, Kei Kawamura. 2375-2379 [doi]
- Deep Markov Clustering for Panoptic SegmentationMinxiang Ye, Yifei Zhang, Shiqiang Zhu, Anhuan Xie, Dan Zhang. 2380-2384 [doi]
- Multi-Task Learning Improves the Brain Stoke Lesion SegmentationLibo Liu, Chengjian Huang, Chunsheng Cai, Xiaodong Zhang, Qingmao Hu. 2385-2389 [doi]
- Mixed Transformer U-Net for Medical Image SegmentationHongyi Wang, Shiao Xie, Lanfen Lin, Yutaro Iwamoto, Xian-Hua Han, Yen-Wei Chen 0001, Ruofeng Tong 0001. 2390-2394 [doi]
- Contrastive Translation Learning For Medical Image SegmentationWankang Zeng, Wenkang Fan, Dongfang Shen, Yinran Chen, Xiongbiao Luo. 2395-2399 [doi]
- Fast Video Object Segmentation via Dynamic YOLACTTianfang Meng, Wenqiang Zhang. 2400-2404 [doi]
- Depth Removal Distillation for RGB-D Semantic SegmentationTiyu Fang, Zhen Liang, Xiuli Shao, Zihao Dong, Jinping Li. 2405-2409 [doi]
- Mask-Based Attention Parallel Network for in-the-Wild Facial Expression RecognitionLingzhao Ju, Xu Zhao. 2410-2414 [doi]
- SDNET: Lightweight Facial Expression Recognition For Sample DisequilibriumLifang Zhou, Siqin Li, Yi Wang, Junlin Liu. 2415-2419 [doi]
- A Novel Micro-Expression Recognition Approach Using Attention-Based Magnification-Adaptive NetworksMengting Wei, Wenming Zheng, Yuan Zong, Xingxun Jiang, Cheng Lu 0005, Jiateng Liu. 2420-2424 [doi]
- Lipreading Model Based On Whole-Part Collaborative LearningWeidong Tian, Housen Zhang, Chen Peng, Zhong-Qiu Zhao. 2425-2429 [doi]
- What Is The Patient Looking At? Robust Gaze-Scene Intersection Under Free-Viewing ConditionsAhmed Al-Hindawi, Marcela P. Vizcaychipi, Yiannis Demiris. 2430-2434 [doi]
- GAZEATTENTIONNET: Gaze Estimation with AttentionsHaoxian Huang, Luqian Ren, Zhuo Yang, Yinwei Zhan, Qieshi Zhang, Jujian Lv. 2435-2439 [doi]
- Low-Light Image Enhancement via Feature RestorationYang Yang, Yonghua Zhang, Xiaojie Guo. 2440-2444 [doi]
- HIRL: Hybrid Image Restoration Based on Hierarchical Deep Reinforcement Learning via Two-Step AnalysisXiaoyu Zhang, Wei Gao 0003. 2445-2449 [doi]
- High-Fidelity Portrait Editing Via Exploring Differentiable Guided Sketches from the Latent SpaceChengrong Wang, Chenjie Cao, Yanwei Fu, Xiangyang Xue. 2450-2454 [doi]
- Learning Adjustable Image Rescaling with Joint Optimization of Perception and DistortionZhihong Pan 0001. 2455-2459 [doi]
- FSOINET: Feature-Space Optimization-Inspired Network For Image Compressive SensingWenjun Chen, Chunling Yang, Xin Yang. 2460-2464 [doi]
- Disentangled Feature-Guided Multi-Exposure High Dynamic Range ImagingKeuntek Lee, Yeong Il Jang, Nam Ik Cho. 2465-2469 [doi]
- Defending Against Universal Attack Via Curvature-Aware Category Adversarial TrainingPeilun Du, Xiaolong Zheng, Liang Liu 0001, Huadong Ma. 2470-2474 [doi]
- SP Attack: Single-Perspective Attack for Generating Adversarial Omnidirectional ImagesYunjian Zhang, Yanwei Liu, Jinxia Liu, Pengwei Zhan, Liming Wang, Zhen Xu. 2475-2479 [doi]
- Few-Shot One-Class Domain Adaptation Based On Frequency For Iris Presentation Attack DetectionYachun Li, Ying Lian, Jingjing Wang, Yuhui Chen, Chunmao Wang, Shiliang Pu. 2480-2484 [doi]
- Pixinwav: Residual Steganography for Hiding Pixels in AudioMargarita Geleta, Cristina Punti, Kevin McGuinness, Jordi Pons, Cristian Canton, Xavier Giró i Nieto. 2485-2489 [doi]
- A Semi-Handcrafted Keypoint Detector with Discriminative Feature EncodingYurui Xie, Ling Guan. 2490-2494 [doi]
- Safari from Visual Signals: Recovering Volumetric 3d ShapesAntonio Agudo. 2495-2499 [doi]
- Coupled Feature Learning Via Structured Convolutional Sparse Coding for Multimodal Image FusionFarshad G. Veshki, Sergiy A. Vorobyov. 2500-2504 [doi]
- DOMAINDESC: Learning Local Descriptors With Domain AdaptationRongtao Xu, Changwei Wang, Bin Fan, Yuyang Zhang, Shibiao Xu, Weiliang Meng, Xiaopeng Zhang 0001. 2505-2509 [doi]
- Multi-Head Relu Implicit Neural Representation NetworksArya Aftab, Alireza Morsali, Shahrokh Ghaemmaghami. 2510-2514 [doi]
- An Efficient Method for Model Pruning Using Knowledge Distillation with Few SamplesZhaoJing Zhou, Yun Zhou, Zhuqing Jiang, Aidong Men, Haiying Wang 0005. 2515-2519 [doi]
- Adaptive Intra-Group Aggregation for Co-Saliency DetectionGuangyu Ren, Tianhong Dai, Tania Stathaki. 2520-2524 [doi]
- Novel Class Discovery: A Dependency ApproachTanmoy Mukherjee, Nikos Deligiannis. 2525-2528 [doi]
- Single-Shot Balanced Detector for Geospatial Object DetectionYanfeng Liu, Qiang Li, Yuan Yuan, Qi Wang. 2529-2533 [doi]
- Regularized Latent Space Exploration for Discriminative Face Super-ResolutionRuixin Shi, Junzheng Zhang, Yong Li, Shiming Ge. 2534-2538 [doi]
- Enhancing and Dissecting Crowd Counting by Synthetic DataYi Hou, Chengyang Li, Yuheng Lu, Liping Zhu, Yuan Li, Huizhu Jia, Xiaodong Xie. 2539-2543 [doi]
- Multi-Pose Virtual Try-On Via Self-Adaptive Feature FilteringChenghu Du, Feng Yu 0017, Minghua Jiang, Xiong Wei, Tao Peng 0006, Xinrong Hu. 2544-2548 [doi]
- Histogram-Guided Semantic-Aware ColorizationJie Zhang, Yi Xiao, Guo Chen, Qingping Sun, Fangqiang Xu, Chi-Sing Leung. 2549-2553 [doi]
- Content Preserving Scale Space Network for Fast Image Restoration from Noisy-Blurry PairsGreen Rosh K. S, Nikhil Krishnan, B. H. Pawan Prasad, Sachin Deepak Lomte. 2554-2558 [doi]
- Flow-Based Point Cloud Completion Network with Adversarial RefinementRong Bao, Yurui Ren, Ge Li 0002, Wei Gao 0003, Shan Liu 0001. 2559-2563 [doi]
- Weakly Supervised Point Cloud Upsampling VIA Optimal TransportZezeng Li, Weimin Wang, Na Lei, Rui Wang. 2564-2568 [doi]
- Point Cloud Denoising Using Normal Vector-Based Graph Wavelet ShrinkageRyosuke Watanabe, Keisuke Nonaka, Haruhisa Kato, Eduardo Pavez, Tatsuya Kobayashi, Antonio Ortega. 2569-2573 [doi]
- Dynamic Point Cloud InterpolationAnique Akhtar, Zhu Li 0001, Geert Van Der Auwera, Jianle Chen. 2574-2578 [doi]
- Point Cloud Attribute Compression Via Chroma SubsamplingShashank N. Sridhara, Eduardo Pavez, Antonio Ortega, Ryosuke Watanabe, Keisuke Nonaka. 2579-2583 [doi]
- Rangeinet: Fast Lidar Point Cloud Temporal InterpolationLili Zhao, Xuhu Lin, Wenyi Wang, Kai-Kuang Ma, Jianwen Chen. 2584-2588 [doi]
- MBNet: A Multi-Resolution Branch Network for Semantic Segmentation Of Ultra-High Resolution ImagesLianlei Shan, Weiqiang Wang. 2589-2593 [doi]
- BSOLO: Boundary-Aware One-Stage Instance Segmentation SOLOYuxuan Zhang, Wei Yang. 2594-2598 [doi]
- CS-GResNet: A Simple and Highly Efficient Network for Facial Expression RecognitionShaoping Jiang, Xiangmin Xu, Fang Liu, Xiaofen Xing, Lin Wang. 2599-2603 [doi]
- RCANet: Row-Column Attention Network for Semantic SegmentationBingxu Lu, Qinghua Hu, Yu Wang, Guosheng Hu. 2604-2608 [doi]
- Exploring Category Consistency for Weakly Supervised Semantic SegmentationZhaozhi Xie, Hongtao Lu. 2609-2613 [doi]
- Vision Transformer Equipped With Neural Resizer On Facial Expression Recognition TaskHyeonbin Hwang, Soyeon Kim, Wei-Jin Park, Jiho Seo, Kyungtae Ko, Hyeon Yeo. 2614-2618 [doi]
- ISDA: Position-Aware Instance Segmentation with Deformable AttentionKaining Ying, Zhenhua Wang, Cong Bai, Pengfei Zhou. 2619-2623 [doi]
- Improving Class Activation Map for Weakly Supervised Object LocalizationZhenfei Zhang, Ming-Ching Chang, Tien D. Bui. 2624-2628 [doi]
- A Robust Object Segmentation Network for UnderWater ScenesRuizhe Chen, Zhenqi Fu, Yue Huang 0001, En Cheng, Xinghao Ding. 2629-2633 [doi]
- A Fast and Efficient Network for Single Image Shadow DetectionLeiping Jie, Hui Zhang. 2634-2638 [doi]
- Importance Sampling Cams For Weakly-Supervised SegmentationArvi Jonnarth, Michael Felsberg. 2639-2643 [doi]
- DeepGBASS: Deep Guided Boundary-Aware Semantic SegmentationQingfeng Liu, Hai Su, Mostafa El-Khamy, Kee-Bong Song. 2644-2648 [doi]
- Camera Calibration Through Camera Projection LossTalha Hanif Butt, Murtaza Taj. 2649-2653 [doi]
- Inferring Camera Intrinsics Based on Surfaces of Revolution: A Single Image Geometric Network Approach for Camera CalibrationChristopher Walker, Yuxing Wang, Yawen Lu, Guoyu Lu. 2654-2658 [doi]
- Text2video: Text-Driven Talking-Head Video Synthesis with Personalized Phoneme - Pose DictionarySibo Zhang, Jiahong Yuan, Miao Liao, Liangjun Zhang. 2659-2663 [doi]
- Towards Accurate Cross-Domain in-Bed Human Pose EstimationMohamed Afham, Udith Haputhanthri, Jathurshan Pradeepkumar, Mithunjha Anandakumar, Ashwin De Silva, Chamira U. S. Edussooriya. 2664-2668 [doi]
- Learning Monocular Mesh Recovery of Multiple Body Parts Via SynthesisYu Sun, Tianyu Huang, Qian Bao, Wu Liu, Wenpeng Gao, Yili Fu. 2669-2673 [doi]
- LightPose: A Lightweight and Efficient Model with Transformer for Human Pose EstimationXiyang Liu, Peng Li, Ding Ni, Yan Wang, Hui Xue. 2674-2678 [doi]
- On The Observability in Visual Slam NetworksQier An, Yuan Shen. 2679-2683 [doi]
- Variational Bayesian Framework for Advanced Image Generation with Domain-Related VariablesYuxiao Li, Santiago Mazuelas, Yuan Shen. 2684-2688 [doi]
- The Impact of JPEG Compression on Prior Image NoiseMarina Gardella, Tina Nikoukhah, Yanhao Li, Quentin Bammey. 2689-2693 [doi]
- On the Use of Component Structural Characteristics for Voxel Segmentation in Semicon 3D ImagesTin Lay Nwe, Ramanpreet Singh Pahwa, Richard Chang 0002, Oo Zaw Min, Jie Wang 0042, Yiqun Li, Dongyun Lin, Shitala Prasad, Sheng Dong. 2694-2698 [doi]
- Blind Source Separation via a Weak Exclusion PrincipleZihan Zhang, Thierry Blu. 2699-2703 [doi]
- Graph Convolution for Re-Ranking in Person Re-IdentificationYuqi Zhang, Qi Qian 0001, Chong Liu 0002, Weihua Chen, Fan Wang, Hao Li, Rong Jin 0001. 2704-2708 [doi]
- Multi-Level Relation Aware Network for Person Re-IdentificationJing Yang, Canlong Zhang, Zhixin Li, Yanping Tang. 2709-2713 [doi]
- Progressive-Granularity Retrieval Via Hierarchical Feature Alignment for Person Re-IdentificationZhaopeng Dou, Zhongdao Wang, Yali Li, Shengjin Wang. 2714-2718 [doi]
- Occluded Person Re-Identification Via Relational Adaptive Feature Correction LearningMinjung Kim, MyeongAh Cho, Heansung Lee, Suhwan Cho, Sangyoun Lee. 2719-2723 [doi]
- Learning Semantic-Aligned Feature Representation for Text-Based Person SearchShiping Li, Min Cao, Min Zhang. 2724-2728 [doi]
- Transformer-Based Person Search Model with Symmetric Online Instance MatchingXuezhi Xiang, Ning Lv, Yulong Qiao. 2729-2733 [doi]
- Wassertrain: An Adversarial Training Framework Against Wasserstein Adversarial AttacksQingye Zhao, Xin Chen 0027, Zhuoyu Zhao, Enyi Tang, Xuandong Li. 2734-2738 [doi]
- Efficient Universal Shuffle Attack for Visual Object TrackingSiao Liu, Zhaoyu Chen, Wei Li, Jiwei Zhu, Jiafeng Wang, Wenqiang Zhang, Zhongxue Gan. 2739-2743 [doi]
- Non-Rigid Transformation Based Adversarial Attack Against 3d Object TrackingRiran Cheng, Nan Sang, Yinyuan Zhou, Xupeng Wang. 2744-2748 [doi]
- Adversary Distillation for One-Shot Attacks on 3D Target TrackingZhengyi Wang, Xupeng Wang, Ferdous Sohel, Mohammed Bennamoun, Yong Liao, Jiali Yu. 2749-2453 [doi]
- AdverFacial: Privacy-Preserving Universal Adversarial Perturbation Against Facial Micro-Expression LeakagesYin Yin Low, Angeline Tanvy, Raphaël C.-W. Phan, Xiaojun Chang. 2754-2758 [doi]
- Interpretable Image Classification Using Sparse Oblique Decision TreesSuryabhan Singh Hada, Miguel Á. Carreira-Perpiñán. 2759-2763 [doi]
- Underwater Image Enhancement Via Learning Water Type Desensitized RepresentationsZhenqi Fu, Xiaopeng Lin, Wu Wang, Yue Huang 0001, Xinghao Ding. 2764-2768 [doi]
- A Wavelet-Based Dual-Stream Network for Underwater Image EnhancementZiyin Ma, Changjae Oh. 2769-2773 [doi]
- Unsupervised and Untrained Underwater Image Restoration Based on Physical Image Formation ModelShu Chai, Zhenqi Fu, Yue Huang 0001, Xiaotong Tu, Xinghao Ding. 2774-2778 [doi]
- Agcyclegan: Attention-Guided Cyclegan for Single Underwater Image RestorationZhenlong Wang, Weifeng Liu 0001, Yanjiang Wang 0001, Baodi Liu. 2779-2783 [doi]