Topology, Big Data and Optimization
2016 (English)In: Big Data Optimization: Recent Developments and Challenges, Springer Science and Business Media Deutschland GmbH , 2016, p. 147-176Chapter in book (Refereed)
Abstract [en]
The idea of using geometry in learning and inference has a long history going back to canonical ideas such as Fisher information, Discriminant analysis, and Principal component analysis. The related area of Topological Data Analysis (TDA) has been developing in the last decade. The idea is to extract robust topological features from data and use these summaries for modeling the data. A topological summary generates a coordinate-free, deformation invariant and highly compressed description of the geometry of an arbitrary data set. Topological techniques are well-suited to extend our understanding of Big Data. These tools do not supplant existing techniques, but rather provide a complementary viewpoint to existing techniques. The qualitative nature of topological features do not give particular importance to individual samples, and the coordinate-free nature of topology generates algorithms and viewpoints well suited to highly complex datasets. With the introduction of persistence and other geometric-topological ideas we can find and quantify local-to-global properties as well as quantifying qualitative changes in data.
Place, publisher, year, edition, pages
Springer Science and Business Media Deutschland GmbH , 2016. p. 147-176
Series
Studies in Big Data, ISSN 2197-6503, E-ISSN 2197-6511 ; 18
Keywords [en]
Curve, Euler characteristic, Mapper, Persistent homology, Topological Data Analysis Applied topology, Big data, Curve fitting, Data handling, Data mining, Discriminant analysis, Fisher information matrix, Geometry, Principal component analysis, Coordinate-free, Fisher information, Optimisations, Topological data analyse applied topology, Topological data analysis, Topological features, Topology
National Category
Information Systems
Identifiers
URN: urn:nbn:se:kth:diva-314416DOI: 10.1007/978-3-319-30265-2_7Scopus ID: 2-s2.0-85028322938OAI: oai:DiVA.org:kth-314416DiVA, id: diva2:1672967
Note
QC 20220620
Part of book ISBN 978-3-319-30265-2
2022-06-202022-06-202022-06-25Bibliographically approved