¿Cómo interpretar la salida de regresión logística de weka?

Question

Oct 02, 2013, 01:49 PM

¿Cómo interpretar la salida de regresión logística de weka?

Por favor ayude a interpretar los resultados de la regresión logística producida por weka.classifiers.functions.Logistic de la biblioteca Weka.

Utilizo datos numéricos de los ejemplos de Weka:

@relation weather

@attribute outlook {sunny, overcast, rainy}
@attribute temperature real
@attribute humidity real
@attribute windy {TRUE, FALSE}
@attribute play {yes, no}

@data
sunny,85,85,FALSE,no
sunny,80,90,TRUE,no
overcast,83,86,FALSE,yes
rainy,70,96,FALSE,yes
rainy,68,80,FALSE,yes
rainy,65,70,TRUE,no
overcast,64,65,TRUE,yes
sunny,72,95,FALSE,no
sunny,69,70,FALSE,yes
rainy,75,80,FALSE,yes
sunny,75,70,TRUE,yes
overcast,72,90,TRUE,yes
overcast,81,75,FALSE,yes
rainy,71,91,TRUE,no

Para crear un modelo de regresión logística utilizo el comando: java -cp $ WEKA_INS / weka.jar weka.classifiers.functions.Logistic -t $ WEKA_INS / data / weather.numeric.arff -T $ WEKA_INS / data / weather.numeric.arff - d ./weather.numeric.model.arff

Aquí los tres argumentos significan:

-t <name of training file> : Sets training file.
-T <name of test file> : Sets test file. 
-d <name of output file> : Sets model output file.

Ejecutando el comando anterior producimos el siguiente resultado:

Logistic Regression with ridge parameter of 1.0E-8
Coefficients...
              Class
Variable                    yes
===============================
outlook=sunny           -6.4257
outlook=overcast        13.5922
outlook=rainy           -5.6562
temperature             -0.0776
humidity                -0.1556
windy                    3.7317
Intercept                22.234

Odds Ratios...
              Class
Variable                    yes
===============================
outlook=sunny            0.0016
outlook=overcast    799848.4264
outlook=rainy            0.0035
temperature              0.9254
humidity                 0.8559
windy                   41.7508


Time taken to build model: 0.05 seconds
Time taken to test model on training data: 0 seconds

=== Error on training data ===
Correctly Classified Instances          11               78.5714 %
Incorrectly Classified Instances         3               21.4286 %
Kappa statistic                          0.5532
Mean absolute error                      0.2066
Root mean squared error                  0.3273
Relative absolute error                 44.4963 %
Root relative squared error             68.2597 %
Total Number of Instances               14     

=== Confusion Matrix ===
 a b   <-- classified as
 7 2 | a = yes
 1 4 | b = no

Preguntas:

1) Primera sección del informe:

Coefficients...
              Class
Variable                    yes
===============================
outlook=sunny           -6.4257
outlook=overcast        13.5922
outlook=rainy           -5.6562
temperature             -0.0776
humidity                -0.1556
windy                    3.7317
Intercept                22.234

1.1) ¿Entiendo bien que los "coeficientes" son de hecho ponderaciones que se aplican a cada atributo antes de sumarlos para generar el valor de la clase "juego" igual a "sí"?

2) Segunda sección del informe:

Odds Ratios...
              Class
Variable                    yes
===============================
outlook=sunny            0.0016
outlook=overcast    799848.4264
outlook=rainy            0.0035
temperature              0.9254
humidity                 0.8559
windy                   41.7508

2.1) ¿Cuál es el significado de "Odds Ratios"? 2.2) ¿Todos ellos también se relacionan con el atributo de clase "juego" igual a "sí"? 2.3) ¿Por qué el valor de "outlook = overcast" es mucho más grande que el valor de "outlook = sunny"?

3)

=== Confusion Matrix ===
 a b   <-- classified as
 7 2 | a = yes
 1 4 | b = no

3.1) ¿Qué es la menaing de Matriz de Confusión?

¡Muchas gracias por tu ayuda!

Respuestas a la pregunta(1)

Preguntas populares

0 la respuesta

¿Por qué NSAssert se divide en main en lugar del código que llama a la aserción?

0 la respuesta

Cálculo del valor total en la columna JTable usando un nuevo valor

0 la respuesta

Forzando una descarga usando <filesMatch> en htaccess

0 la respuesta

Recorriendo las partes de una selección no contigua en MS * Word * VBA

0 la respuesta

¿Puedo agregar jarras a maven 2 build classpath sin instalarlas?

¡Eres muy activo! ¡Es genial!

¿Cómo interpretar la salida de regresión logística de weka?

Respuestas a la pregunta(1)

Su respuesta a la pregunta

Preguntas populares