Deep multi-survey classification of variable stars

C. Aguirre, K. Pichara, I. Becker

Research output: Contribution to journalScientific reviewResearchpeer-review

2 Citations (Scopus)

Abstract

© 2018 The Author(s). During the last decade, a considerable amount of effort has been made to classify variable stars using different machine-learning techniques. Typically, light curves are represented as vectors of statistical descriptors or features that are used to train various algorithms. These features demand high computational power and can last from hours to days, making it impossible to create scalable and efficient ways of classifying variable stars automatically. Also, light curves from different surveys cannot be integrated and analysed together when using features, because of observational differences. For example, having variations in cadence and filters, feature distributions become biased and require expensive data-calibration models. The vast amount of data that will be generated soon makes it necessary to develop scalable machine-learning architectures without expensive integration techniques. Convolutional neural networks have shown impressive results in raw image classification and representation within the machine-learning literature. In this work, we present a novel deep-learning model for light-curve classification, based mainly on convolutional units. Our architecture receives as input the differences between the time and magnitude of light curves. It captures the essential classification patterns, regardless of cadence and filter. In addition, we introduce a novel data augmentation schema for unevenly sampled time series. We tested our method using three different surveys: the Optical Gravitational Lensing Experiment (OGLE-III), the Visible and Infrared Survey Telescope (VISTA) and Convection, Rotation and planetary Transit (CoRot) which differ in filters, cadence and area of the sky. We show that, besides the benefit of scalability, our model obtains state-of-the-art level accuracy in light-curve classification benchmarks.
Original languageAmerican English
Pages (from-to)5078-5092
Number of pages4568
JournalMonthly Notices of the Royal Astronomical Society
DOIs
StatePublished - 1 Feb 2019
Externally publishedYes

Fingerprint

variable stars
light curve
machine learning
filter
filters
image classification
transit
classifying
learning
train
sky
convection
telescopes
time series
calibration
augmentation
experiment

Cite this

@article{834f901073f849b1892f056c99bbdf40,
title = "Deep multi-survey classification of variable stars",
abstract = "{\circledC} 2018 The Author(s). During the last decade, a considerable amount of effort has been made to classify variable stars using different machine-learning techniques. Typically, light curves are represented as vectors of statistical descriptors or features that are used to train various algorithms. These features demand high computational power and can last from hours to days, making it impossible to create scalable and efficient ways of classifying variable stars automatically. Also, light curves from different surveys cannot be integrated and analysed together when using features, because of observational differences. For example, having variations in cadence and filters, feature distributions become biased and require expensive data-calibration models. The vast amount of data that will be generated soon makes it necessary to develop scalable machine-learning architectures without expensive integration techniques. Convolutional neural networks have shown impressive results in raw image classification and representation within the machine-learning literature. In this work, we present a novel deep-learning model for light-curve classification, based mainly on convolutional units. Our architecture receives as input the differences between the time and magnitude of light curves. It captures the essential classification patterns, regardless of cadence and filter. In addition, we introduce a novel data augmentation schema for unevenly sampled time series. We tested our method using three different surveys: the Optical Gravitational Lensing Experiment (OGLE-III), the Visible and Infrared Survey Telescope (VISTA) and Convection, Rotation and planetary Transit (CoRot) which differ in filters, cadence and area of the sky. We show that, besides the benefit of scalability, our model obtains state-of-the-art level accuracy in light-curve classification benchmarks.",
author = "C. Aguirre and K. Pichara and I. Becker",
year = "2019",
month = "2",
day = "1",
doi = "10.1093/mnras/sty2836",
language = "American English",
pages = "5078--5092",
journal = "Monthly Notices of the Royal Astronomical Society",
issn = "0035-8711",
publisher = "Oxford University Press",

}

Deep multi-survey classification of variable stars. / Aguirre, C.; Pichara, K.; Becker, I.

In: Monthly Notices of the Royal Astronomical Society, 01.02.2019, p. 5078-5092.

Research output: Contribution to journalScientific reviewResearchpeer-review

TY - JOUR

T1 - Deep multi-survey classification of variable stars

AU - Aguirre, C.

AU - Pichara, K.

AU - Becker, I.

PY - 2019/2/1

Y1 - 2019/2/1

N2 - © 2018 The Author(s). During the last decade, a considerable amount of effort has been made to classify variable stars using different machine-learning techniques. Typically, light curves are represented as vectors of statistical descriptors or features that are used to train various algorithms. These features demand high computational power and can last from hours to days, making it impossible to create scalable and efficient ways of classifying variable stars automatically. Also, light curves from different surveys cannot be integrated and analysed together when using features, because of observational differences. For example, having variations in cadence and filters, feature distributions become biased and require expensive data-calibration models. The vast amount of data that will be generated soon makes it necessary to develop scalable machine-learning architectures without expensive integration techniques. Convolutional neural networks have shown impressive results in raw image classification and representation within the machine-learning literature. In this work, we present a novel deep-learning model for light-curve classification, based mainly on convolutional units. Our architecture receives as input the differences between the time and magnitude of light curves. It captures the essential classification patterns, regardless of cadence and filter. In addition, we introduce a novel data augmentation schema for unevenly sampled time series. We tested our method using three different surveys: the Optical Gravitational Lensing Experiment (OGLE-III), the Visible and Infrared Survey Telescope (VISTA) and Convection, Rotation and planetary Transit (CoRot) which differ in filters, cadence and area of the sky. We show that, besides the benefit of scalability, our model obtains state-of-the-art level accuracy in light-curve classification benchmarks.

AB - © 2018 The Author(s). During the last decade, a considerable amount of effort has been made to classify variable stars using different machine-learning techniques. Typically, light curves are represented as vectors of statistical descriptors or features that are used to train various algorithms. These features demand high computational power and can last from hours to days, making it impossible to create scalable and efficient ways of classifying variable stars automatically. Also, light curves from different surveys cannot be integrated and analysed together when using features, because of observational differences. For example, having variations in cadence and filters, feature distributions become biased and require expensive data-calibration models. The vast amount of data that will be generated soon makes it necessary to develop scalable machine-learning architectures without expensive integration techniques. Convolutional neural networks have shown impressive results in raw image classification and representation within the machine-learning literature. In this work, we present a novel deep-learning model for light-curve classification, based mainly on convolutional units. Our architecture receives as input the differences between the time and magnitude of light curves. It captures the essential classification patterns, regardless of cadence and filter. In addition, we introduce a novel data augmentation schema for unevenly sampled time series. We tested our method using three different surveys: the Optical Gravitational Lensing Experiment (OGLE-III), the Visible and Infrared Survey Telescope (VISTA) and Convection, Rotation and planetary Transit (CoRot) which differ in filters, cadence and area of the sky. We show that, besides the benefit of scalability, our model obtains state-of-the-art level accuracy in light-curve classification benchmarks.

UR - https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85061084708&origin=inward

UR - https://www.scopus.com/inward/citedby.uri?partnerID=HzOxMe3b&scp=85061084708&origin=inward

U2 - 10.1093/mnras/sty2836

DO - 10.1093/mnras/sty2836

M3 - Scientific review

SP - 5078

EP - 5092

JO - Monthly Notices of the Royal Astronomical Society

JF - Monthly Notices of the Royal Astronomical Society

SN - 0035-8711

ER -