SMILES Pair Encoding: A Data-Driven Substructure Tokenization Algorithm for Deep Learning

Xinhao Li, Denis Fourches. SMILES Pair Encoding: A Data-Driven Substructure Tokenization Algorithm for Deep Learning. Journal of Chemical Information and Computer Sciences, 61(4):1560-1569, 2021. [doi]

@article{LiF21-6,
  title = {SMILES Pair Encoding: A Data-Driven Substructure Tokenization Algorithm for Deep Learning},
  author = {Xinhao Li and Denis Fourches},
  year = {2021},
  doi = {10.1021/acs.jcim.0c01127},
  url = {https://doi.org/10.1021/acs.jcim.0c01127},
  researchr = {https://researchr.org/publication/LiF21-6},
  cites = {0},
  citedby = {0},
  journal = {Journal of Chemical Information and Computer Sciences},
  volume = {61},
  number = {4},
  pages = {1560-1569},
}