Multi-timescale Feature-extraction Architecture of Deep Neural Networks for Acoustic Model Training from Raw Speech Signal

Ryu Takeda, Kazuhiro Nakadai, Kazunori Komatani. Multi-timescale Feature-extraction Architecture of Deep Neural Networks for Acoustic Model Training from Raw Speech Signal. In 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2018, Madrid, Spain, October 1-5, 2018. pages 2503-2510, IEEE, 2018. [doi]

@inproceedings{TakedaNK18,
  title = {Multi-timescale Feature-extraction Architecture of Deep Neural Networks for Acoustic Model Training from Raw Speech Signal},
  author = {Ryu Takeda and Kazuhiro Nakadai and Kazunori Komatani},
  year = {2018},
  doi = {10.1109/IROS.2018.8593925},
  url = {https://doi.org/10.1109/IROS.2018.8593925},
  researchr = {https://researchr.org/publication/TakedaNK18},
  cites = {0},
  citedby = {0},
  pages = {2503-2510},
  booktitle = {2018 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2018, Madrid, Spain, October 1-5, 2018},
  publisher = {IEEE},
  isbn = {978-1-5386-8094-0},
}