kth.sePublications KTH
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Mining of User Profiles in Online Social Networks for Improved Personalized Recommendations
KTH, School of Electrical Engineering and Computer Science (EECS), Computer Science, Software and Computer systems, SCS.ORCID iD: 0000-0002-7786-9551
2020 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

We have focused on influencer-based marketing in online social networks as a source of implicit learning about the preferences of social media users. Those users who use social networks on a daily basis are also the online shoppers who are confronted with huge information overload and a wide variety of online products and brands to choose from. The role of digital influencers in promoting products and spreading information to a large scale of followers who engage with the influencers’ posts and interact with them is our key to better understanding of these followers’ tastes and future purchase intentions. Hence, the analysis and the extraction of fine-grained details (which we refer to as user profiling) from digital influencers media content serves in collecting more information about the implicit preferences of their followers. With this knowledge, the chances of offering social media users better personalized services are enhanced. In this thesis, we empower cross-domain recommendations through the development of novel methods and algorithms for improving personalization through the effective mining of user profiles in online social networks. We developed a semantic information extraction framework from social media textual content that is able to capture fine-grained attributes with respect to the defined online shops taxonomy. Results form the aforementioned framework have been applied as input to the approaches we proposed to incorporate extracted textual hints in supporting the visual fine-grained classification of social media images in a dynamic way. Our methods have improved the classification accuracy when compared to state-of-the-art approaches. Moreover, we suggested solutions for incorporating the extracted products’ meta-data in embedding-based personalized recommendation architectures where our strategies improved the recommendations’ quality. In order to speed up the process of preparing large scale social media images datasets for deep learning image analysis, we developed a complete framework for detailed annotation, object localization and semantic segmentation. As our focus is also directed towards the analysis of interactions between social media users, we proposed a neural reinforcement learning approach that is based on estimating the established trust levels between social media users for controlling the amount of recommended updates they get from each other. Moreover, we proposed enhanced topic modelling algorithm for supporting interpretable yet dynamic summarizations of large social media contents.

Abstract [sv]

Vi har fokuserat på influenserbaserad marknadsföring i sociala nätverk online som en källa till implicit lärande om sociala medianvändares preferenser. De användare som använder sociala nätverk dagligen är också online-shoppare som står inför enorm informationsöverbelastning och ett brett utbud av onlineprodukter och varumärken att välja mellan. Rollen hos digitala influenser när det gäller att marknadsföra produkter och sprida information till en stor skala av anhängare som engagerar sig i influencers inlägg och interagerar med dem är vår nyckel till bättre förståelse för dessa anhängares smak och framtida köpintentioner. Analysen och utvinningen av finkorniga detaljer (som vi kallar textit user profiling) från medieinnehåll för digitala influenser tjänar därför till att samla in mer information om deras implicita preferenser. Med denna kunskap tillämpad för att berika användarprofiler för sociala medier förbättras chanserna att erbjuda dem bättre anpassade tjänster. I denna avhandling ger vi rekommendationer över gränserna genom utveckling av nya metoder och algoritmer för att förbättra personalisering genom effektiv utvinning av användarprofiler i sociala nätverk online. Vi utvecklade en semantisk ram för informationsextraktion från textinnehåll i sociala medier som kan fånga finkorniga attribut med avseende på den definierade onlinebutikens taxonomi. Resultat från ovannämnda ramverk har använts som input till de tillvägagångssätt som vi föreslog för att införliva extraherade texttips för att stödja den visuella finkorniga klassificeringen av sociala mediebilder på ett dynamiskt sätt. Våra metoder har förbättrat klassificeringsnoggrannheten jämfört med toppmoderna metoder. Dessutom föreslog vi lösningar för att integrera de extraherade produkternas metadata i inbäddningsbaserade personliga rekommendationsarkitekturer där våra strategier förbättrade rekommendationernas kvalitet. För att påskynda processen att förbereda storskaliga bildmängder för sociala medier för djupinlärningsbildanalys utvecklade vi en komplett ram för detaljerad kommentar, objektlokalisering och semantisk segmentering. Eftersom vårt fokus också riktas mot analysen av interaktioner mellan användare av sociala medier, föreslog vi en neurologisk förstärkningsinlärningsmetod som baseras på att uppskatta de etablerade tillitsnivåerna mellan användare av sociala medier för att kontrollera mängden rekommenderade uppdateringar de får från varandra. Dessutom föreslog vi förbättrad ämnesmodelleringsalgoritm för att stödja tolkbara men dynamiska sammanfattningar av stora sociala medieinnehåll.

Place, publisher, year, edition, pages
Stockholm: KTH Royal Institute of Technology, 2020. , p. 133
Series
TRITA-EECS-AVL ; 2020:59
National Category
Computer Systems
Identifiers
URN: urn:nbn:se:kth:diva-285522ISBN: 978-91-7873-688-1 (print)OAI: oai:DiVA.org:kth-285522DiVA, id: diva2:1499132
Public defence
2020-12-03, Sal C, Kistagången 16, Kista, Stockholm, 16:00 (English)
Opponent
Supervisors
Note

QC 20201106

Available from: 2020-11-06 Created: 2020-11-06 Last updated: 2022-06-25Bibliographically approved
List of papers
1. OLLDA: A Supervised and Dynamic Topic Mining Framework in Twitter
Open this publication in new window or tab >>OLLDA: A Supervised and Dynamic Topic Mining Framework in Twitter
2015 (English)In: 2015 IEEE International Conference on Data Mining Workshop (ICDMW), 2015, p. 1354-1359Conference paper, Published paper (Refereed)
Abstract [en]

Analyzing media in real-time is of great importance with social media platforms at the epicenter of crunching, digesting and disseminating content to individuals connected to these platforms. Within this context, topic models, specially LDA, have gained strong momentum due to their scalability, inference power and their compact semantics. Although, state of the art topic models come short in handling streaming large chunks of data arriving dynamically onto the platform, thus hindering their quality of interpretation as well as their adaptability to information overload. As a result, in this manuscript we propose for a labelled and online extension to LDA (OLLDA), which incorporates supervision through external labeling and capability of quickly digesting real-time updates thus making it more adaptive to Twitter and platforms alike. Our proposed extension has capability of handling large quantities of newly arrived documents in a stream, and at the same time, is capable of achieving high topic inference quality given the short and often sloppy text of tweets. Our approach mainly uses an approximate inference technique based on variational inference coupled with a labeled LDA model. We conclude by presenting experiments using a one year crawl of Twitter data that shows significantly improved topical inference as well as temporal user profile classification when compared to state of the art baselines.

National Category
Computer and Information Sciences
Identifiers
urn:nbn:se:kth:diva-192057 (URN)10.1109/ICDMW.2015.132 (DOI)000380556700183 ()2-s2.0-84964797270 (Scopus ID)978-1-4673-8493-3 (ISBN)
External cooperation:
Conference
IEEE 15th International Conference on Data Mining Workshops (ICDMW), NOV 14-17, 2015, ATlantic city, NJ
Note

QC 20160906

Available from: 2016-09-06 Created: 2016-09-05 Last updated: 2024-03-18Bibliographically approved
2. Deep Text Mining of Instagram Data Without Strong Supervision
Open this publication in new window or tab >>Deep Text Mining of Instagram Data Without Strong Supervision
2018 (English)In: Proceedings - 2018 IEEE/WIC/ACM International Conference on Web Intelligence, WI 2018, IEEE, 2018, p. 158-165Conference paper, Published paper (Refereed)
Abstract [en]

With the advent of social media, our online feeds increasingly consist of short, informal, and unstructured text. This textual data can be analyzed for the purpose of improving user recommendations and detecting trends. Instagram is one of the largest social media platforms, containing both text and images. However, most of the prior research on text processing in social media is focused on analyzing Twitter data, and little attention has been paid to text mining of Instagram data. Moreover, many text mining methods rely on annotated training data, which in practice is both difficult and expensive to obtain. In this paper, we present methods for unsupervised mining of fashion attributes from Instagram text, which can enable a new kind of user recommendation in the fashion domain. In this context, we analyze a corpora of Instagram posts from the fashion domain, introduce a system for extracting fashion attributes from Instagram, and train a deep clothing classifier with weak supervision to classify Instagram posts based on the associated text. With our experiments, we confirm that word embeddings are a useful asset for information extraction. Experimental results show that information extraction using word embeddings outperforms a baseline that uses Levenshtein distance. The results also show the benefit of combining weak supervision signals using generative models instead of majority voting. Using weak supervision and generative modeling, an F1 score of 0.61 is achieved on the task of classifying the image contents of Instagram posts based solely on the associated text, which is on level with human performance. Finally, our empirical study provides one of the few available studies on Instagram text and shows that the text is noisy, that the text distribution exhibits the long-tail phenomenon, and that comment sections on Instagram are multi-lingual.

Place, publisher, year, edition, pages
IEEE, 2018
Keywords
Information extraction, Instagram, Weak Supervision, Word Embeddings
National Category
Natural Language Processing
Identifiers
urn:nbn:se:kth:diva-245979 (URN)10.1109/WI.2018.00-94 (DOI)000458968200021 ()2-s2.0-85061892408 (Scopus ID)9781538673256 (ISBN)
Conference
18th IEEE/WIC/ACM International Conference on Web Intelligence, WI 2018; Santiago; Chile; 3 December 2018 through 6 December 2018
Note

QC 20190313

Available from: 2019-03-13 Created: 2019-03-13 Last updated: 2025-02-07Bibliographically approved
3. Outfit2Vec: Incorporating Clothing Hierarchical MetaData into Outfits’ Recommendation
Open this publication in new window or tab >>Outfit2Vec: Incorporating Clothing Hierarchical MetaData into Outfits’ Recommendation
2020 (English)In: Lecture Notes in Social Networks book series (LNSN): Fashion Recommender Systems / [ed] Springer, Springer: Springer Nature , 2020Chapter in book (Other academic)
Place, publisher, year, edition, pages
Springer: Springer Nature, 2020
National Category
Computer Systems
Identifiers
urn:nbn:se:kth:diva-285555 (URN)10.1007/978-3-030-55218-3_5 (DOI)
Note

QC 20201130

Available from: 2020-11-05 Created: 2020-11-05 Last updated: 2023-03-30Bibliographically approved
4. Learning What to Share in Online Social Networks Using Deep Reinforcement Learning
Open this publication in new window or tab >>Learning What to Share in Online Social Networks Using Deep Reinforcement Learning
2018 (English)In: Machine Learning Techniques for Online Social Networks, Springer International Publishing , 2018, p. 115-133Chapter in book (Other academic)
Abstract [en]

Online networking sites tried their best to have right privacy mechanisms in place for users, enabling them to share the right content with the right audience. With all these efforts, privacy customizations remain hard for users across the sites. Existing research that addresses this problem mainly focuses on semi-supervised strategies that introduce extra complexity by requiring the user to manually specify initial privacy preferences for their friends. In this work, we suggest a deep reinforcement learning framework that can dynamically generate privacy labels for users in OSNs. We evaluated our framework on a 1 year crawl of Twitter data, using different types of recurrent units in recurrent neural networks (RNN): Long Short Term Memory (LSTM), Gated Recurrent Unit (GRU), and Simple RNN. Our experiments revealed that LSTM performed better than GRU in terms of top users detection accuracy and the ranked dependence between the generated privacy labels and estimated user trust values.

Place, publisher, year, edition, pages
Springer International Publishing, 2018
National Category
Computer Sciences
Identifiers
urn:nbn:se:kth:diva-285552 (URN)10.1007/978-3-319-89932-9_6 (DOI)
Note

QC 20201105

Available from: 2020-11-05 Created: 2020-11-05 Last updated: 2022-10-24Bibliographically approved
5. Deep cross-domain fashion recommendation
Open this publication in new window or tab >>Deep cross-domain fashion recommendation
2017 (English)In: RecSys 2017 - Proceedings of the 11th ACM Conference on Recommender Systems, Association for Computing Machinery (ACM), 2017, p. 407-410Conference paper, Published paper (Refereed)
Abstract [en]

With the increasing number of online shopping services, the number of users and the quantity of visual and textual information on the Internet, there is a pressing need for intelligent recommendation systems that analyze the user's behavior amongst multiple domains and help them to find the desirable information without the burden of search. However, there is little research that has been done on complex recommendation scenarios that involve knowledge transfer across multiple domains. This problem is especially challenging when the involved data sources are complex in terms of the limitations on the quantity and quality of data that can be crawled. The goal of this paper is studying the connection between visual and textual inputs for better analysis of a certain domain, and to examine the possibility of knowledge transfer from complex domains for the purpose of efficient recommendations. The methods employed to achieve this study include both design of architecture and algorithms using deep learning technologies to analyze the effect of deep pixel-wise semantic segmentation and text integration on the quality of recommendations. We plan to develop a practical testing environment in a fashion domain.

Place, publisher, year, edition, pages
Association for Computing Machinery (ACM), 2017
Keywords
CNN, Cross-domain knowledge transfer, Deep learning, Domain adaptation, Fashion recommendation, Transfer learning
National Category
Computer Sciences
Identifiers
urn:nbn:se:kth:diva-218546 (URN)10.1145/3109859.3109861 (DOI)000426967000089 ()2-s2.0-85030469064 (Scopus ID)
Conference
11th ACM Conference on Recommender Systems, RecSys 2017, Como, Italy, 27 August 2017 through 31 August 2017
Note

QC 20241106

Part of ISBN 978-145034652-8

Available from: 2017-11-30 Created: 2017-11-30 Last updated: 2024-11-06Bibliographically approved
6. TALS: A framework for text analysis, fine-grained annotation, localisation and semantic segmentation
Open this publication in new window or tab >>TALS: A framework for text analysis, fine-grained annotation, localisation and semantic segmentation
Show others...
2019 (English)In: Proceedings - International Computer Software and Applications Conference, IEEE Computer Society, 2019, Vol. 8754470, p. 201-206Conference paper, Published paper (Refereed)
Abstract [en]

With around 2.77 billion users using online social media platforms nowadays, it is becoming more attractive for business retailers to reach and to connect to more potential clients through social media. However, providing more effective recommendations to grab clients’ attention requires a deep understanding of users’ interests. Given the enormous amounts of text and images that users share in social media, deep learning approaches play a major role in performing semantic analysis of text and images. Moreover, object localisation and pixel-by-pixel semantic segmentation image analysis neural architectures provide an enhanced level of information. However, to train such architectures in an end-to-end manner, detailed datasets with specific meta-data are required. In our paper, we present a complete framework that can be used to tag images in a hierarchical fashion, and to perform object localisation and semantic segmentation. In addition to this, we show the value of using neural word embeddings in providing additional semantic details to annotators to guide them in annotating images in the system. Our framework is designed to be a fully functional solution capable of providing fine-grained annotations, essential localisation and segmentation services while keeping the core architecture simple and extensible. We also provide a fine-grained labelled fashion dataset that can be a rich source for research purposes.

Place, publisher, year, edition, pages
IEEE Computer Society, 2019
Series
Proceedings - International Computer Software and Applications Conference, ISSN 0730-3157
Keywords
Annotations, Dataset, Deep learning, Fine-grained, Localisation, Natural language processing, Semantic segmentation, Word embeddings
National Category
Computer and Information Sciences
Identifiers
urn:nbn:se:kth:diva-262582 (URN)10.1109/COMPSAC.2019.10207 (DOI)000538781300034 ()2-s2.0-85072655269 (Scopus ID)
Conference
43rd IEEE Annual Computer Software and Applications Conference, COMPSAC 2019; Milwaukee; United States; 15 July 2019 through 19 July 2019
Note

QC 20191022

Part of ISBN 9781728126074

Available from: 2019-10-22 Created: 2019-10-22 Last updated: 2024-10-21Bibliographically approved
7. Dynamic CNN Models For Fashion Recommendation in Instagram
Open this publication in new window or tab >>Dynamic CNN Models For Fashion Recommendation in Instagram
Show others...
2018 (English)In: 2018 IEEE INT CONF ON PARALLEL & DISTRIBUTED PROCESSING WITH APPLICATIONS, UBIQUITOUS COMPUTING & COMMUNICATIONS, BIG DATA & CLOUD COMPUTING, SOCIAL COMPUTING & NETWORKING, SUSTAINABLE COMPUTING & COMMUNICATIONS / [ed] Chen, JJ Yang, LT, IEEE COMPUTER SOC , 2018, p. 1144-1151Conference paper, Published paper (Refereed)
Abstract [en]

Instagram as an online photo-sharing and social-networking service is becoming more powerful in enabling fashion brands to ramp up their business growth. Nowadays, a single post by a fashion influencer attracts a wealth of attention and a magnitude of followers who are curious to know more about the brands and style of each clothing item sitting inside the image. To this end, the development of efficient Deep CNN models that can accurately detect styles and brands have become a research challenge. In addition, current techniques need to cope with inherent fashion-related data issues. Namely, clothing details inside a single image only cover a small proportion of the large and hierarchical space of possible brands and clothing item attributes. In order to cope with these challenges, one can argue that neural classifiers should become adapted to large-scale and hierarchical fashion datasets. As a remedy, we propose two novel techniques to incorporate the valuable social media textual content to support the visual classification in a dynamic way. The first method is adaptive neural pruning (DynamicPruning) in which the clothing item category detected from posts' text analysis can be used to activate the possible range of connections of clothing attributes' classifier. The second method (DynamicLayers) is a dynamic framework in which multiple-attributes classification layers exist and a suitable attributes' classifier layer is activated dynamically based upon the mined text from the image. Extensive experiments on a dataset gathered from Instagram and a baseline fashion dataset (DeepFashion) have demonstrated that our approaches can improve the accuracy by about 20% when compared to base architectures. It is worth highlighting that with Dynamiclayers we have gained 35% accuracy for the task of multi-class multi-labeled classification compared to the other model.

Place, publisher, year, edition, pages
IEEE COMPUTER SOC, 2018
Series
IEEE International Symposium on Parallel and Distributed Processing with Applications, ISSN 2158-9178
Keywords
Neural Pruning, Dynamic Computation Graph, Dynamic CNN, Image Classification, Text Mining, Fashion Recommendation
National Category
Computer and Information Sciences
Identifiers
urn:nbn:se:kth:diva-252671 (URN)10.1109/BDCloud.2018.00169 (DOI)000467843200155 ()2-s2.0-85063866491 (Scopus ID)978-1-7281-1141-4 (ISBN)
Conference
16th IEEE ISPA / 17th IEEE IUCC / 8th IEEE BDCloud / 11th IEEE SocialCom / 8th IEEE SustainCom, DEC 11-13, 2018, Melbourne, AUSTRALIA
Note

QC 20190603

Available from: 2019-06-03 Created: 2019-06-03 Last updated: 2024-03-18Bibliographically approved

Open Access in DiVA

Shatha_Jaradat_Doctoral_Thesis(2951 kB)1637 downloads
File information
File name FULLTEXT01.pdfFile size 2951 kBChecksum SHA-512
22282c37dcdca4acef75feec9dbf0552706d0f34e49b2fdecb97e690335c03f1ce4f02700a9dcd94b8f17be547077cb75dd7fafeba9bedbe2d6fcb56a21e6612
Type fulltextMimetype application/pdf

Search in DiVA

By author/editor
Jaradat, Shatha
By organisation
Software and Computer systems, SCS
Computer Systems

Search outside of DiVA

GoogleGoogle Scholar
Total: 1638 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

isbn
urn-nbn

Altmetric score

isbn
urn-nbn
Total: 1468 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf