kth.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Full-Glow: Fully conditional Glow for more realistic image generation
KTH, School of Electrical Engineering and Computer Science (EECS), Computer Science, Computational Science and Technology (CST).ORCID iD: 0000-0001-6204-0778
KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Speech, Music and Hearing, TMH.ORCID iD: 0000-0002-1643-1054
KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Robotics, Perception and Learning, RPL.ORCID iD: 0000-0002-5750-9655
2021 (English)In: Pattern Recognition: 43rd DAGM German Conference, DAGM GCPR 2021 / [ed] Bauckhage, C., Gall, J., Schwing, A., Cham, Switzerland: Springer Nature , 2021, Vol. 13024, p. 697-711Conference paper, Published paper (Refereed)
Abstract [en]

Autonomous agents, such as driverless cars, require large amounts of labeled visual data for their training. A viable approach for acquiring such data is training a generative model with collected real data, and then augmenting the collected real dataset with synthetic images from the model, generated with control of the scene layout and ground truth labeling. In this paper we propose Full-Glow, a fully conditional Glow-based architecture for generating plausible and realistic images of novel street scenes given a semantic segmentation map indicating the scene layout. Benchmark comparisons show our model to outperform recent works in terms of the semantic segmentation performance of a pretrained PSPNet. This indicates that images from our model are, to a higher degree than from other models, similar to real images of the same kinds of scenes and objects, making them suitable as training data for a visual semantic segmentation or object recognition system.

Place, publisher, year, edition, pages
Cham, Switzerland: Springer Nature , 2021. Vol. 13024, p. 697-711
Series
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), ISSN 0302-9743 ; 13024
Keywords [en]
Conditional image generation, generative models, normalizing flows
National Category
Computer and Information Sciences
Research subject
Computer Science
Identifiers
URN: urn:nbn:se:kth:diva-312897DOI: 10.1007/978-3-030-92659-5_45Scopus ID: 2-s2.0-85124290582OAI: oai:DiVA.org:kth-312897DiVA, id: diva2:1660588
Conference
43rd DAGM German Conference on Pattern Recognition, DAGM GCPR 2021, Virtual, Online, 28 September 2021 through 1 October 2021
Funder
Wallenberg AI, Autonomous Systems and Software Program (WASP)
Note

QC 20220530

Part of proceedings: ISBN 978-303092658-8

Available from: 2022-05-24 Created: 2022-05-24 Last updated: 2024-01-18Bibliographically approved

Open Access in DiVA

fulltext(16906 kB)183 downloads
File information
File name FULLTEXT01.pdfFile size 16906 kBChecksum SHA-512
826b7da4750247f2dcf05ae54979eaab98ce659fce983994f9171e91e526186453051a0096a85d9ccc3870b2beb228b73c0584ac06eb59d85973cea0b2cdaf16
Type fulltextMimetype application/pdf

Other links

Publisher's full textScopus

Authority records

Sorkhei, Mohammad MoeinHenter, Gustav EjeKjellström, Hedvig

Search in DiVA

By author/editor
Sorkhei, Mohammad MoeinHenter, Gustav EjeKjellström, Hedvig
By organisation
Computational Science and Technology (CST)Speech, Music and Hearing, TMHRobotics, Perception and Learning, RPL
Computer and Information Sciences

Search outside of DiVA

GoogleGoogle Scholar
Total: 183 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 239 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf