Yih-Ling Hedley, Muhammad Younas, Anne E. James, Mark Sanderson. A Two-Phase Sampling Technique to Improve the Accuracy of Text Similarities in the Categorisation of Hidden Web Databases. In Xiaofang Zhou, Stanley Y. W. Su, Mike P. Papazoglou, Maria E. Orlowska, Keith G. Jeffery, editors, Web Information Systems - WISE 2004, 5th International Conference on Web Information Systems Engineering, Brisbane, Australia, November 22-24, 2004, Proceedings. Volume 3306 of Lecture Notes in Computer Science, pages 516-527, Springer, 2004. [doi]