| Corpus | Search Engine Click-through Log(SogouQ) |
|---|---|
| Version | 2008 |
| Introduction | SogouQ contains search engine click-through log data collected by sogou.com in June 2008. Privacy information has been removed as well as porn-related/illegal query sessions. |
| The corpus records the following information (seperated by "\t"): | Time of the click event user ID(automatically assigned by system) [user query] Ranking of the clicked URL Ordering of user click the clicked URL |
| Adopted By | NTCIR 2011 intent task CLEF LogCLEF 2011 task |
| Related Resources |
SogouT Web corpus Search performance evaluation benchmark for SogouT Hyperlink structure data for SogouT PageRank scores for SogouT |
| Realted Publications | 1.Predicting Epidemic Tendency through Search Behavior Analysis. Danqing Xu, Yiqun Liu, Min Zhang, Liyun Ru, Shaoping Ma. In Proceedings of the 22nd International Joint Conference on Artificial Intelligence (IJCAI-11) (Barcelona, Spain). 2.How do users describe their information need: Query recommendation based on snippet click model Yiqun Liu, Junwei Miao, Min Zhang, Shaoping Ma, Liyun Ru. Expert Systems With Applications. 38(11): 13847-13856, 2011. 3. Automatic Search Engine Performance Evaluation with Click-through Data Analysis. Yiqun Liu, Yupeng Fu, Min Zhang, Shaoping Ma, Liyun Ru, Poster proceedings of the 16th International World Wide Web Conference (WWW07), 2007, Banff, Alberta, Canada. 4.Incorporating Web Browsing Information into Anchor Texts for Web Search Bo Zhou, Yiqun Liu, Min Zhang, Yijiang Jin, Shaoping Ma. Information Retrieval Volume 14, Issue 3: 290-314, 2011. |
| Download | Please read the "License for Use of Sogou Lab Data" carefully before downloading. Mini Sample(376KB): gzip compressed, zip compressed Reduced Sample(One day, 63MB): gzip compressed, zip compressed Completed data(1.9GB): gzip compressed, zip compressed |