SYLVA

SYLVA is the ecology database

The task of SYLVA is to classify forest cover types. The forest cover type for 30 x 30 meter cells is obtained from US Forest Service (USFS) Region 2 Resource Information System (RIS) data. We brought it back to a two-class classification problem (classifying Ponderosa pine vs. everything else). The “agnostic learning track” data consists in 216 input variables. Each pattern is composed of 4 records: 2 true records matching the target and 2 records picked at random. Thus ½ of the features are distracters. The “prior knowledge track” data is identical to the “agnostic learning track” data, except that the distracters are removed and the identity of the features is revealed. For that track, the forest cover original ids are revealed for training data.