Dataset: Yago [link]
Dataset: Douban movie, Douban book, Yelp [download]
Douban book: The data set in book domain comprises 792,062 ratings (scales 1–5) by 13,024 users on 22,347 books.
Dataset: Douban book, Dianping [download]
Douban book: The Douban Book dataset contains 190,590 ratings (1-5 scales) involving 12,850 users and 22,040 books.
Dianping: The Dianping dataset contains 188,813 ratings (1-5 scales) involving 10,549 users and 17,707 restaurants.
Dataset: Yago [link]
Yago: Yago is a huge semantic knowledge graph derived from Wikipedia, WordNet and GeoNames. Currently, it has knowledge about more than 10 million entities and contains more than 120 million facts. We adopt "yagoFacts", "yagoSimpleTypes" and "yagoTaxonomy" parts of this dataset to conduct experiments, which contain 35 relationships, more than 1.3 million entities of 3,455 instance classes.
Dataset: DBLP, ACM, IMDB [link]
Dataset: Douban movie, MovieLens, Yelp challenge, Douban book
Dataset: TripAdvisor, Dianping [download]
TripAdvisor: The TripAdvisor dataset contains 162,595 ratings on 79013 users on 5530 hotels.
Dianping: The Dianping dataset contains 216,291 ratings by 14,022 users on 1097 restaurants.
Dataset: Yago [link]
Yago: Yalgo is a large-scale Knowledge Graph, which derived from Wikipedia, WordNet and GeoNames. The dataset includes more than ten million entities and 120 million facts made from these entities. We only adopt “COREFact" of this dataset, which contains 4,484,914 facts, 35 relationships and 1,369,931 entities of 3,455 types.
Dataset: Douban movie, Yelp [download]
Douban movie: Douban is a well known social media network in China. The dataset includes 3,022 users and 6,971 movies with 195,493 ratings ranging from 1 to 5.
Yelp: Yelp is a famous user review website in America. The dataset includes 14,085 users and 14,037 movies with 194,255 ratings ranging from 1 to 5.
Dataset: Douban movie, Yelp [download]
Douban movie: Douban dataset includes 13,367 users and 12,677 movies with 1068278 movie ratings ranging from 1 to 5.
Yelp: Yelp dataset contains user ratings on local business and attribute information of users and businesses. The dataset includes 16239 users and 14284 local businesses with 198397 ratings from 1 to 5.
Dataset: ACM, DBLP [download]
ACM: The data set has 12K papers, 17K authors, and 1.8K author affiliations.
DBLP: The dataset contains 14K papers, 20 conferences, 14K authors and 8.9K terms, with a total number of 17K links. In the data set, 4,057 authors, all 20 conferences and 100 papers are labeled with one of the four research areas.
Dataset: Movielens, LastFM, Yelp, Amazon [download]
LastFM: This dataset contains social networking, bookmarking, and tagging information from a set of 2K users from Delicious social bookmarking system.
Amazon: This dataset contains product reviews and metadata from Amazon, including 142.8 million reviews spanning May 1996 - July 2014, which includes reviews (ratings, text, helpfulness votes), product metadata (descriptions, category information, price, brand, and image features), and links (also viewed/also bought graphs).