Normal view MARC view ISBD view

Understanding High-Dimensional Spaces [electronic resource] / by David B. Skillicorn.

By: Skillicorn, David B [author.].
Contributor(s): SpringerLink (Online service).
Material type: materialTypeLabelBookSeries: SpringerBriefs in Computer Science: Publisher: Berlin, Heidelberg : Springer Berlin Heidelberg : Imprint: Springer, 2012Description: IX, 108 p. 29 illus. online resource.Content type: text Media type: computer Carrier type: online resourceISBN: 9783642333989.Subject(s): Computer science | Data protection | Data structures (Computer science) | Information systems | Electronic data processing | Computer Science | Information Systems and Communication Service | Data Structures, Cryptology and Information Theory | Computing Methodologies | Systems and Data Security | e-Commerce/e-businessDDC classification: 005.7 Online resources: Click here to access online
Contents:
Introduction -- Basic Structure of High-Dimensional Spaces -- Algorithms -- Spaces with a Single Center -- Spaces with Multiple Clusters -- Representation by Graphs -- Using Models of High-Dimensional Spaces -- Including Contextual Information -- Conclusions -- Index -- References.
In: Springer eBooksSummary: High-dimensional spaces arise as a way of modelling datasets with many attributes. Such a dataset can be directly represented in a space spanned by its attributes, with each record represented as a point in the space with its position depending on its attribute values. Such spaces are not easy to work with because of their high dimensionality: our intuition about space is not reliable, and measures such as distance do not provide as clear information as we might expect. There are three main areas where complex high dimensionality and large datasets arise naturally: data collected by online retailers, preference sites, and social media sites, and customer relationship databases, where there are large but sparse records available for each individual; data derived from text and speech, where the attributes are words and so the corresponding datasets are wide, and sparse; and data collected for security, defense, law enforcement, and intelligence purposes, where the datasets are large and wide. Such datasets are usually understood either by finding the set of clusters they contain or by looking for the outliers, but these strategies conceal subtleties that are often ignored. In this book the author suggests new ways of thinking about high-dimensional spaces using two models: a skeleton that relates the clusters to one another; and boundaries in the empty space between clusters that provide new perspectives on outliers and on outlying regions. The book will be of value to practitioners, graduate students and researchers.
Tags from this library: No tags from this library for this title. Log in to add tags.
No physical items for this record

Introduction -- Basic Structure of High-Dimensional Spaces -- Algorithms -- Spaces with a Single Center -- Spaces with Multiple Clusters -- Representation by Graphs -- Using Models of High-Dimensional Spaces -- Including Contextual Information -- Conclusions -- Index -- References.

High-dimensional spaces arise as a way of modelling datasets with many attributes. Such a dataset can be directly represented in a space spanned by its attributes, with each record represented as a point in the space with its position depending on its attribute values. Such spaces are not easy to work with because of their high dimensionality: our intuition about space is not reliable, and measures such as distance do not provide as clear information as we might expect. There are three main areas where complex high dimensionality and large datasets arise naturally: data collected by online retailers, preference sites, and social media sites, and customer relationship databases, where there are large but sparse records available for each individual; data derived from text and speech, where the attributes are words and so the corresponding datasets are wide, and sparse; and data collected for security, defense, law enforcement, and intelligence purposes, where the datasets are large and wide. Such datasets are usually understood either by finding the set of clusters they contain or by looking for the outliers, but these strategies conceal subtleties that are often ignored. In this book the author suggests new ways of thinking about high-dimensional spaces using two models: a skeleton that relates the clusters to one another; and boundaries in the empty space between clusters that provide new perspectives on outliers and on outlying regions. The book will be of value to practitioners, graduate students and researchers.

There are no comments for this item.

Log in to your account to post a comment.

2017 | The Technical University of Kenya Library | +254(020) 2219929, 3341639, 3343672 | library@tukenya.ac.ke | Haile Selassie Avenue