Pix2seq: A Language Modeling Framework for Object Detection

Ting Chen, Saurabh Saxena, Lala Li, David J. Fleet, Geoffrey E. Hinton. Pix2seq: A Language Modeling Framework for Object Detection. In The Tenth International Conference on Learning Representations, ICLR 2022, Virtual Event, April 25-29, 2022. OpenReview.net, 2022. [doi]

References

No references recorded for this publication.

Cited by

No citations of this publication recorded.