LIBSVM Data: Classification (Binary Class)

This page contains many classification, regression, multi-label and string data sets stored in LIBSVM format. For some sets raw materials (e.g., original texts) are also available. These data sets are from UCI, Statlog, StatLib and other collections. We thank their efforts. For most sets, we linearly scale each attribute to [-1,1] or [0,1]. The testing data (if provided) is adjusted accordingly. Some training data are further separated to "training" (tr) and "validation" (val) sets. Details can be found in the description of each data set. To read data via MATLAB, you can use "libsvmread" in LIBSVM package.


a1a

a2a

a3a

a4a

a5a

a6a

a7a

a8a

a9a

australian

avazu

breast-cancer

cod-rna

colon-cancer

covtype.binary

criteo

criteo_tb

diabetes

duke breast-cancer

epsilon

fourclass

german.numer

gisette

heart

HIGGS

Hyperpartisan News Detection

ijcnn1

imdb-sentiment

ionosphere

kdd2010 (algebra)

kdd2010 (bridge to algebra)

kdd2010 raw version (bridge to algebra)

kdd2012

leukemia

liver-disorders

madelon

mushrooms

news20.binary

phishing

rcv1.binary

real-sim

skin_nonskin

splice

splice-site

sonar

SUSY

svmguide1

svmguide3

url

w1a

w2a

w3a

w4a

w5a

w6a

w7a

w8a

webspam