SEMAUTO-2.0

Semantics-Aware Autoencoder Neural Network implemented in TensorFlow.

SEMAUTO is a Semantics-Aware Autoencoder Neural Network which exploits knowledge encoded in a Knowledge Graph in order to build an Autoencoder whose topology reflects connections in the KG to weigh features related to rated items to build an explicit representation of the user profile.

In the last years, deep learning has shown to be a game-changing technology in artificial intelligence thanks to the numerous successes it reached in diverse application fields. Among others, the use of deep learning for the recommendation problem, although new, looks quite promising due to its positive performances in terms of accuracy of recommendation results. In a recommendation setting, in order to predict user ratings on unknown items a possible configuration of a deep neural network is that of autoencoders tipically used to produce a lower dimensionality representation of the original data. SEMAUTO is an autoencoder that bases the structure of its neural network on the semantics-aware topology of a knowledge graph thus providing an explicit label for neurons in the hidden layer that are eventually used to build a user profile and then compute recommendations. It has been shown that SEMAUTO outperforms other state of the art algorithms in terms of accuracy, diversity and novelty of recommended results.

The evaluation has been performed by adopting the RankSys Java library.

Reference

If you publish research that uses SEMAUTO-2.0, please cite it as

@InProceedings{DOTD16, 
 author = {Vito Bellini, Angelo Schiavone, Tommaso {Di Noia}, Azzurra Ragone, Eugenio {Di Sciascio}},
 title = {Computing recommendations via a Knowledge Graph-aware Autoencoder},
 booktitle = {Proceedings of the RecSys 2018 Workshop on Knowledge-aware and Conversational Recommender 
              Systems (KaRS) co-located with 12th {ACM} Conference on Recommender Systems
              (RecSys 2018), Vancouver, Canada, October 7, 2018.},
 year = {2018}
}

The full paper describing the overall approach is available here PDF

Configuration

All the config params are placed in config.ini file.

Train and test files

[DEFAULT]

# Train file path
training_file = train-1m.tsv

# Test file path
test_file = test-1m.tsv

Knowledge Graph settings

[KG]
# Mapping file
dbpedia_map = dbpediamap.tsv

# SPARQL endpoint
sparql_endpoint = http://dbpedia.org/sparql

# Directory in which to store KG data
directory = KG

# File containing a list of predicates used to explore the KG
predicates_file = predicates

SEMAUTO

[SEMAUTO]
# Directory in which to store matrices used by SEMAUTO
directory = nets

# Directory in which to store users profile
user_profiles_dir = UP

Word2Vec

[W2V]
# Voucaboulary of features
features_file = features_space.txt

# File containing a senteces made of pairs of feature and its weight (ie. feature_name-0.3)
senteces_file = sentences.tsv

# Map of indexed features
features_dict_file = features_map.dict

# Directory in which to save new enhanced users profile
newDir = w2vUP

# Keep users' order of the data listed in sentences_file
users_sentences = user_sentences.tsv

# Word2Vec model params (using gensim framework)
epochs = 50
size = 100
window = 500
min_count = 1

Item-KNN

[VSM]

# List of neighboorhoods size
knn_file = knns.txt

# Recommendation output file
save = predictions

Run

$ run.sh

Credits

This library has been developed by Vito Bellini and Angelo Schiavone while working at SisInf Lab under the supervision of Tommaso Di Noia.

Contacts

Vito Bellini, vito [dot] bellini [at] poliba [dot] it

Tommaso Di Noia, tommaso [dot] dinoia [at] poliba [dot] it

Angelo Schiavone, angelo [dot] schiavone [at] poliba [dot] it

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
paper_results		paper_results
LICENSE		LICENSE
README.md		README.md
config.ini		config.ini
dbpediamap.tsv		dbpediamap.tsv
genre		genre
getKG.py		getKG.py
getSentencesWeighted.py		getSentencesWeighted.py
knns.txt		knns.txt
matrix.py		matrix.py
oracle.jar		oracle.jar
predicates		predicates
run.sh		run.sh
semauto.py		semauto.py
test-1m.tsv		test-1m.tsv
train-1m.tsv		train-1m.tsv
vsm.py		vsm.py
word2vecWeighted.py		word2vecWeighted.py

License

sisinflab/SEMAUTO-2.0

Folders and files

Latest commit

History

Repository files navigation

SEMAUTO-2.0

Semantics-Aware Autoencoder Neural Network implemented in TensorFlow.

Reference

Configuration

Train and test files

Knowledge Graph settings

SEMAUTO

Word2Vec

Item-KNN

Run

Credits

Contacts

About

Resources

License

Stars

Watchers

Forks

Languages