Оценка результатов построения деревьев решений при помощи регрессионного анализа

Возникновение и применения метода построения деревьев решений. Основные существующие алгоритмы и решаемые ими задачи. Существующие статистические методы, применяемые для решения тех же задач. Категориальная бинарная и небинарная целевая переменная.

Рубрика Экономико-математическое моделирование
Вид дипломная работа
Язык русский
Дата добавления 01.12.2019
Размер файла 591,6 K

Отправить свою хорошую работу в базу знаний просто. Используйте форму, расположенную ниже

Студенты, аспиранты, молодые ученые, использующие базу знаний в своей учебе и работе, будут вам очень благодарны.

39) age=35-44,25-34,45-54,55-64 1494 580 Happy (0.6117805 0.3882195)

78) Marital=Married 1334 492 Happy (0.6311844 0.3688156) *

79) Marital=Single 160 72 Unhappy (0.4500000 0.5500000)

158) Real_inc>=12613.96 74 30 Happy (0.5945946 0.4054054) *

159) Real_inc< 12613.96 86 28 Unhappy (0.3255814 0.6744186) *

5) Marital=No longer married 319 114 Unhappy (0.3573668 0.6426332) *

3) Health=Fair health,Bad health 4748 1755 Unhappy (0.3696293 0.6303707)

6) Marital=Married 3389 1444 Unhappy (0.4260844 0.5739156)

12) age=18-24,25-34 695 319 Happy (0.5410072 0.4589928)

24) status=Countryside,Small town 151 54 Happy (0.6423841 0.3576159) *

25) status=Town,Regional center 544 265 Happy (0.5128676 0.4871324)

50) Real_inc< 836.5103 62 21 Happy (0.6612903 0.3387097) *

51) Real_inc>=836.5103 482 238 Unhappy (0.4937759 0.5062241)

102) Real_inc>=19838.7 147 63 Happy (0.5714286 0.4285714) *

103) Real_inc< 19838.7 335 154 Unhappy (0.4597015 0.5402985)

206) Own_home=Do not own house 61 25 Happy (0.5901639 0.4098361) *

207) Own_home=Own house 274 118 Unhappy (0.4306569 0.5693431)

414) Real_inc< 7582.569 91 44 Happy (0.5164835 0.4835165)

828) Real_inc>=6061.862 30 10 Happy (0.6666667 0.3333333) *

829) Real_inc< 6061.862 61 27 Unhappy (0.4426230 0.5573770) *

415) Real_inc>=7582.569 183 71 Unhappy (0.3879781 0.6120219) *

13) age=35-44,45-54,55-64 2694 1068 Unhappy (0.3964365 0.6035635)

26) Real_inc>=19025 664 328 Unhappy (0.4939759 0.5060241)

52) status=Town,Small town 243 98 Happy (0.5967078 0.4032922) *

53) status=Countryside,Regional center 421 183 Unhappy (0.4346793 0.5653207) *

27) Real_inc< 19025 2030 740 Unhappy (0.3645320 0.6354680) *

7) Marital=Single,No longer married 1359 311 Unhappy (0.2288447 0.7711553) *

Спецификация дерева CHAID

Model formula:

Unhapp_k_3 ~ sex + status + age + educ + Rich + Poor + Rel_rich +

Rel_poor + Health + Obese + Overweight + Employment + Is_russian +

Marital + Has_children + Own_home + Savings

Fitted party:

[1] root

| [2] Health in Fair health

| | [3] Marital in Single: Unhappy (n = 441, err = 29.9%)

| | [4] Marital in Married

| | | [5] age in 35-44

| | | | [6] Has_children in No children: Unhappy (n = 58, err = 22.4%)

| | | | [7] Has_children in Has children

| | | | | [8] Is_russian in Other: Happy (n = 90, err = 37.8%)

| | | | | [9] Is_russian in Russian_nationality

| | | | | | [10] Rel_rich in Not relatively rich: Unhappy (n = 542, err = 41.7%)

| | | | | | [11] Rel_rich in Relatively_rich: Happy (n = 95, err = 41.1%)

| | | [12] age in 18-24, 25-34

| | | | [13] status in Town, Regional center: Happy (n = 488, err = 48.6%)

| | | | [14] status in Countryside, Small town: Happy (n = 143, err = 34.3%)

| | | [15] age in 45-54, 55-64

| | | | [16] Poor in Not poor

| | | | | [17] status in Town, Countryside, Small town

| | | | | | [18] Own_home in Do not own house: Unhappy (n = 21, err = 23.8%)

| | | | | | [19] Own_home in Own house: Unhappy (n = 651, err = 45.8%)

| | | | | [20] status in Regional center: Unhappy (n = 490, err = 34.9%)

| | | | [21] Poor in Poor: Unhappy (n = 333, err = 31.5%)

| | [22] Marital in No longer married

| | | [23] Has_children in No children: Unhappy (n = 61, err = 8.2%)

| | | [24] Has_children in Has children

| | | | [25] Rel_rich in Not relatively rich: Unhappy (n = 440, err = 20.0%)

| | | | [26] Rel_rich in Relatively_rich: Unhappy (n = 150, err = 30.7%)

| [27] Health in Bad health: Unhappy (n = 484, err = 25.0%)

| [28] Health in Good health

| | [29] Marital in Single

| | | [30] Is_russian in Other: Happy (n = 199, err = 16.6%)

| | | [31] Is_russian in Russian_nationality

| | | | [32] age in 35-44, 45-54: Unhappy (n = 79, err = 40.5%)

| | | | [33] age in 18-24, 55-64: Happy (n = 409, err = 34.0%)

| | | | [34] age in 25-34: Happy (n = 185, err = 47.6%)

| | [35] Marital in Married

| | | [36] Is_russian in Other: Happy (n = 391, err = 18.4%)

| | | [37] Is_russian in Russian_nationality

| | | | [38] educ in Middle, Low

| | | | | [39] age in 35-44, 45-54, 55-64

| | | | | | [40] Savings in No savings

| | | | | | | [41] status in Town, Countryside, Regional center: Happy (n = 604, err = 42.5%)

| | | | | | | [42] status in Small town: Happy (n = 51, err = 21.6%)

| | | | | | [43] Savings in Has savings: Happy (n = 92, err = 25.0%)

| | | | | [44] age in 18-24: Happy (n = 109, err = 16.5%)

| | | | | [45] age in 25-34

| | | | | | [46] Obese in Not obese: Happy (n = 462, err = 36.6%)

| | | | | | [47] Obese in Obese: Happy (n = 52, err = 15.4%)

| | | | [48] educ in Higher: Happy (n = 784, err = 24.6%)

| | [49] Marital in No longer married: Unhappy (n = 304, err = 35.2%)

Number of inner nodes: 21

Number of terminal nodes: 28

Спецификация дерева C50

Call:

C5.0.formula(formula = Unhapp_k_3 ~., data = train)

Class specified by attribute `outcome'

Read 8679 cases (19 attributes) from undefined.data

Decision tree:

Health = Good health:

:...Marital = No longer married: Unhappy (323.7/114.9)

: Marital in {Single,Married}:

: :...Is_russian = Other: Happy (626.6/109)

: Is_russian = Russian_nationality:

: :...Marital = Married: Happy (2283.6/725.3)

: Marital = Single:

: :...age in {35-44,45-54,55-64}: Unhappy (96.6/40.7)

: age = 18-24: Happy (418/140.6)

: age = 25-34:

: :...Rel_poor = Not relatively poor: Happy (156.9/66.4)

: Rel_poor = Relatively_poor: Unhappy (41/15)

Health in {Fair health,Bad health}:

:...Marital in {Single,No longer married}: Unhappy (1358.9/311.9)

Marital = Married:

:...age in {18-24,25-34}:

:...status = Small town: Happy (25.5/10.5)

: status = Countryside:

: :...Health = Fair health: Happy (123.5/41.5)

: : Health = Bad health: Unhappy (3)

: status = Town:

: :...Rich = Not rich:

: : :...age = 18-24: Happy (22/7)

: : : age = 25-34: Unhappy (147.4/58)

: : Rich = Rich:

: : :...age = 18-24: Unhappy (3)

: : age = 25-34: Happy (41.4/12.4)

: status = Regional center:

: :...Own_home = Do not own house: Happy (64/22)

: Own_home = Own house:

: :...educ = Higher: Happy (152.1/66.5)

: educ = Low: Unhappy (18/6)

: educ = Middle:

: :...Rel_poor = Not relatively poor: Unhappy (75/28)

: Rel_poor = Relatively_poor: Happy (22/6)

age in {35-44,45-54,55-64}:

:...Rel_rich = Relatively_rich:

:...Is_russian = Other: Happy (52/17)

: Is_russian = Russian_nationality:

: :...status in {Town,Small town}: Happy (122.6/52.1)

: status = Countryside: Unhappy (63/29)

: status = Regional center:

: :...age in {45-54,55-64}: Unhappy (109.3/39.3)

: age = 35-44:

: :...Health = Bad health: Unhappy (3)

: Health = Fair health:

: :...Has_children = No children: Unhappy (7/1)

: Has_children = Has children: Happy (48.5/20)

Rel_rich = Not relatively rich:

:...Has_children = No children: Unhappy (138.1/35)

Has_children = Has children:

:...Own_home = Do not own house: Unhappy (119.7/35)

Own_home = Own house:

:...Health = Bad health:

:...educ in {Middle,Low}: Unhappy (177.4/45.2)

: educ = Higher:

: :...Poor = Poor: Happy (7.1/1.1)

: Poor = Not poor:

: :...Savings = No savings: Unhappy (28.3/9.1)

: Savings = Has savings: Happy (6.2/1.2)

Health = Fair health:

:...Is_russian = Other:

:...age = 35-44: Happy (76.5/29.5)

: age in {45-54,55-64}: Unhappy (133.5/56)

Is_russian = Russian_nationality:

:...Savings = Has savings:

:...Overweight = Not overweight: Happy (124.3/56.8)

: Overweight = Overweight: Unhappy (75.8/25.4)

Savings = No savings:

:...Employment in {Disabled,Inactive,

: Unemployed}: Unhappy (160.1/53.1)

Employment = Retired:

:...educ = Low: Unhappy (43.5/14.5)

: educ = Middle: [S1]

: educ = Higher: [S2]

Employment = Employed:

:...age = 45-54: Unhappy (353.7/117.9)

age in {35-44,55-64}:

:...Real_inc <= 4368.257:

:...Obese = Obese: Happy (10.2)

: Obese = Not obese:

: :...sex = Male: Unhappy (9.8/1.8)

: sex = Female: Happy (21.9/6.9)

Real_inc > 4368.257:

:...Poor = Poor: Unhappy (53.5/15)

Poor = Not poor: [S3]

Спецификация дерева C4.5/J48

J48 pruned tree

------------------

Health = Fair health

| Marital = Single: Unhappy (432.0/128.0)

| Marital = Married

| | Rel_rich = Not relatively rich

| | | age = 35-44

| | | | Is_russian = Other: Happy (84.0/33.0)

| | | | Is_russian = Russian_nationality: Unhappy (589.0/236.0)

| | | age = 18-24: Happy (53.0/19.0)

| | | age = 25-34

| | | | status = Town: Unhappy (137.0/58.0)

| | | | status = Countryside: Happy (100.0/35.0)

| | | | status = Regional center

| | | | | sex = Male: Unhappy (94.0/44.0)

| | | | | sex = Female: Happy (104.0/44.0)

| | | | status = Small town: Happy (15.0/7.0)

| | | age = 45-54: Unhappy (588.0/208.0)

| | | age = 55-64: Unhappy (661.0/259.0)

| | Rel_rich = Relatively_rich

| | | status = Town: Happy (120.0/44.0)

| | | status = Countryside: Happy (67.0/29.0)

| | | status = Regional center: Unhappy (190.0/87.0)

| | | status = Small town: Happy (15.0/4.0)

| Marital = No longer married: Unhappy (646.0/139.0)

Health = Bad health: Unhappy (477.0/118.0)

Health = Good health

| Marital = Single

| | Is_russian = Other: Happy (191.0/33.0)

| | Is_russian = Russian_nationality

| | | age = 35-44: Unhappy (62.0/25.0)

| | | age = 18-24: Happy (387.0/131.0)

| | | age = 25-34

| | | | Real_inc <= 13951.216793: Unhappy (87.0/32.0)

| | | | Real_inc > 13951.216793: Happy (94.0/32.0)

| | | age = 45-54: Unhappy (16.0/7.0)

| | | age = 55-64: Unhappy (9.0/4.0)

| Marital = Married: Happy (2470.0/733.0)

| Marital = No longer married: Unhappy (298.0/103.0)

Number of Leaves : 26

Size of the tree: 38

Спецификация дерева LMT

Logistic model tree

------------------

Health = Fair health

| Marital = Single: LM_1:1/3 (432)

| Marital = Married

| | Rel_rich = Not relatively rich: LM_2:1/4 (2425)

| | Rel_rich = Relatively_rich

| | | Is_russian = Other: LM_3:1/5 (47)

| | | Is_russian = Russian_nationality: LM_4:1/5 (345)

| Marital = No longer married: LM_5:1/3 (646)

Health = Bad health: LM_6:1/2 (477)

Health = Good health: LM_7:1/2 (3614)

Number of Leaves : 7

Size of the Tree: 11

«Divergent drinking patterns and factors affecting homemade alcohol consumption (the case of Russia)»

Таблица 8

Спецификации регрессии

Samogon_consumption

Sex1

-0.974***

age

0.016**

College_ed1

-0.015

College_ed3

0.453**

Employment1

-0.861***

Ln_income

-0.023

South1

0.259

Vilage1

0.938***

Constant

-2.901***

N

3,025

Log Likelihood

-692.437

AIC

1,402.874

*p <.05; **p <.01; ***p <.001

AUC_binomial

0.7011606

cutoff Accuracy Sensitivity Specificity

1: 0.5 0.5441459 0.8219178 0.5232198

Спецификация дерева Rpart

n= 3124

node), split, n, loss, yval, (yprob)

* denotes terminal node

1) root 3124 223 0 (0.92861716 0.07138284) *

> AUC_rpart

[1] 0.5

> measures_rpart

cutoff Accuracy Sensitivity Specificity

1: 0.5 0.92994242 0 1

Спецификация дерева CHAID

Fitted party:

[1] root

| [2] Vilage in 0

| | [3] Employment in 0

| | | [4] Sex in 0

| | | | [5] South in 0: 0 (n = 341, err = 12.0%)

| | | | [6] South in 1

| | | | | [7] College_ed in 0, 1: 0 (n = 18, err = 5.6%)

| | | | | [8] College_ed in 3: 1 (n = 18, err = 38.9%)

| | | [9] Sex in 1: 0 (n = 432, err = 4.4%)

| | [10] Employment in 1

| | | [11] Sex in 0: 0 (n = 901, err = 4.8%)

| | | [12] Sex in 1: 0 (n = 743, err = 1.7%)

| [13] Vilage in 1

| | [14] Employment in 0

| | | [15] College_ed in 0, 1: 0 (n = 70, err = 10.0%)

| | | [16] College_ed in 3: 0 (n = 189, err = 27.5%)

| | [17] Employment in 1

| | | [18] College_ed in 0, 1

| | | | [19] Sex in 0: 0 (n = 91, err = 5.5%)

| | | | [20] Sex in 1: 0 (n = 73, err = 0.0%)

| | | [21] College_ed in 3: 0 (n = 241, err = 12.9%)

Number of inner nodes: 10

Number of terminal nodes: 11

AUC_ch

[1] 0.7171565

measures_ch

cutoff Accuracy Sensitivity Specificity

1: 1.00000000 0.92994242 0.0000000 1.00000000

2: 0.61111111 0.92418426 0.0000000 0.99380805

3: 0.27513228 0.89347409 0.2191781 0.94427245

4: 0.12863071 0.83109405 0.3561644 0.86687307

5: 0.12023460 0.75239923 0.5753425 0.76573787

6: 0.10000000 0.71785029 0.5890411 0.72755418

7: 0.05555556 0.71497121 0.5890411 0.72445820

8: 0.05494505 0.68714012 0.6301370 0.69143447

9: 0.04772475 0.44049904 0.8630137 0.40866873

10: 0.04398148 0.31094050 0.9589041 0.26212590

11: 0.01749664 0.08733205 0.9726027 0.02063983

12: 0.00000000 0.07005758 1.0000000 0.00000000

13: 0.00000000 0.07005758 1.0000000 0.00000000

Спецификация дерева C50

Read 3124 cases (8 attributes) from undefined.data

Decision tree:

0 (3124/223)

Evaluation on training data (3124 cases):

2901 (a): class 0

223 (b): class 1

> AUC_c50

[1] 0.5

measures50

cutoff Accuracy Sensitivity Specificity

1: 0.5 0.92994242 0 1

Спецификация дерева C4.5/J48

J48 pruned tree

------------------

: 0 (3025.0/217.0)

Number of Leaves : 1

Size of the tree: 1

> AUC_j48

[1] 0.5

> measures_j48

cutoff Accuracy Sensitivity Specificity

1: 0.5 0.92994242 0 1

Спецификация дерева LMT

Logistic model tree

------------------

: LM_1:1/1 (3025)

Number of Leaves : 1

Size of the Tree: 1

LM_1:

Class 0:

0.9 +

[Vilage=1] * -0.19

Class 1:

-0.9 +

[Vilage=1] * 0.19

> AUC_LMT

[1] 0.609787

> measures_lmt

cutoff Accuracy Sensitivity Specificity

1: 1.0000000 0.92994242 0.0000000 1.0000000

2: 0.1944205 0.75719770 0.4383562 0.7812178

3: 0.1427246 0.07005758 1.0000000 0.0000000

4: 0.0000000 0.07005758 1.0000000 0.0000000

«Низкооплачиваемые рабочие места на российском рынке труда: есть ли выход и куда он ведет?»

Таблица 9

Спецификации регрессии

TARGET

educHigher

-1.135***

educProfessional

-0.636***

Age35-44

-0.353**

Age45-54

-0.287*

Age55-64

0.402*

City_sizeCity

1.250***

City_sizeSmall_town_or_village

1.945***

City_sizeTown

1.649***

sectorAgriculture

1.188***

sectorCommercial_services

0.319**

sectorConstruction

-0.187

sectorNon-comercial_services

1.122***

Professional_statusBlue_collar_qual

0.184

Professional_statusBlue_collar_unqal

1.168***

Professional_statusWhite_collar_unqual

1.088***

Male

-0.951***

Married

0.101

Children_under_18

0.010

Full_day

0.646***

Constant

-3.208***

N

4,304

Log Likelihood

-2,014.644

AIC

4,069.287

*p <.05; **p <.01; ***p <.001

Спецификация дерева Rpart

n= 5675

node), split, n, loss, yval, (yprob)

* denotes terminal node

1) root 5675 1454 0 (0.7437885 0.2562115)

2) Professional_status=White_collar_qual,Blue_collar_qual 3983 704 0 (0.8232488 0.1767512)

4) sector=Industry,Commercial_services,Construction 2635 329 0 (0.8751423 0.1248577) *

5) sector=Agriculture,Non-comercial_services 1348 375 0 (0.7218101 0.2781899)

10) educ=Higher 704 116 0 (0.8352273 0.1647727) *

11) educ=Secondary or lower,Professional 644 259 0 (0.5978261 0.4021739)

22) City_size=Moskow_or_spb,City 169 40 0 (0.7633136 0.2366864) *

23) City_size=Small_town_or_village,Town 475 219 0 (0.5389474 0.4610526)

46) City_size=Town 168 67 0 (0.6011905 0.3988095)

92) Male>=0.5 40 10 0 (0.7500000 0.2500000) *

93) Male< 0.5 128 57 0 (0.5546875 0.4453125)

186) Age=35-44,45-54 75 28 0 (0.6266667 0.3733333) *

187) Age=24-34,55-64 53 24 1 (0.4528302 0.5471698) *

47) City_size=Small_town_or_village 307 152 0 (0.5048860 0.4951140)

94) Age=24-34,35-44,45-54 251 119 0 (0.5258964 0.4741036)

188) Age=24-34 63 26 0 (0.5873016 0.4126984) *

189) Age=35-44,45-54 188 93 0 (0.5053191 0.4946809)

378) Children_under_18< 0.5 77 34 0 (0.5584416 0.4415584) *

379) Children_under_18>=0.5 111 52 1 (0.4684685 0.5315315)

758) educ=Secondary or lower 55 25 0 (0.5454545 0.4545455) *

759) educ=Professional 56 22 1 (0.3928571 0.6071429) *

95) Age=55-64 56 23 1 (0.4107143 0.5892857) *

3) Professional_status=Blue_collar_unqal,White_collar_unqual 1692 750 0 (0.5567376 0.4432624)

6) City_size=Moskow_or_spb 170 22 0 (0.8705882 0.1294118) *

7) City_size=City,Small_town_or_village,Town 1522 728 0 (0.5216820 0.4783180)

14) Age=24-34,35-44,45-54 1311 580 0 (0.5575896 0.4424104)

28) sector=Industry,Commercial_services,Construction 942 368 0 (0.6093418 0.3906582)

56) City_size=City 356 105 0 (0.7050562 0.2949438) *

57) City_size=Small_town_or_village,Town 586 263 0 (0.5511945 0.4488055)

114) educ=Higher,Professional 261 90 0 (0.6551724 0.3448276) *

115) educ=Secondary or lower 325 152 1 (0.4676923 0.5323077)

230) Male>=0.5 105 39 0 (0.6285714 0.3714286) *

231) Male< 0.5 220 86 1 (0.3909091 0.6090909)

462) City_size=Town 113 50 1 (0.4424779 0.5575221)

924) Age=35-44 34 15 0 (0.5588235 0.4411765) *

925) Age=24-34,45-54 79 31 1 (0.3924051 0.6075949) *

463) City_size=Small_town_or_village 107 36 1 (0.3364486 0.6635514) *

29) sector=Agriculture,Non-comercial_services 369 157 1 (0.4254743 0.5745257)

58) Male>=0.5 149 67 0 (0.5503356 0.4496644)

116) Age=24-34,35-44 117 49 0 (0.5811966 0.4188034)

232) City_size=City,Town 65 22 0 (0.6615385 0.3384615) *

233) City_size=Small_town_or_village 52 25 1 (0.4807692 0.5192308) *

117) Age=45-54 32 14 1 (0.4375000 0.5625000) *

59) Male< 0.5 220 75 1 (0.3409091 0.6590909)

118) educ=Higher 32 12 0 (0.6250000 0.3750000) *

119) educ=Secondary or lower,Professional 188 55 1 (0.2925532 0.7074468) *

15) Age=55-64 211 63 1 (0.2985782 0.7014218) *

Спецификация дерева CHAID

Model formula:

TARGET ~ educ + Age + City_size + sector + Professional_status +

Male + Married + Children_under_18 + Full_day

Fitted party:

[1] root

| [2] Professional_status in White_collar_qual, Blue_collar_qual

| | [3] sector in Industry, Construction

| | | [4] educ in Secondary or lower: 0 (n = 437, err = 15.3%)

| | | [5] educ in Higher, Professional

| | | | [6] Professional_status in White_collar_qual, Blue_collar_unqal, White_collar_unqual: 0 (n = 354, err = 4.8%)

| | | | [7] Professional_status in Blue_collar_qual: 0 (n = 221, err = 10.0%)

| | [8] sector in Agriculture: 0 (n = 122, err = 41.0%)

| | [9] sector in Commercial_services

| | | [10] City_size in Moskow_or_spb: 0 (n = 131, err = 5.3%)

| | | [11] City_size in City: 0 (n = 372, err = 12.1%)

| | | [12] City_size in Small_town_or_village, Town: 0 (n = 490, err = 19.8%)

| | [13] sector in Non-comercial_services

| | | [14] educ in Secondary or lower: 1 (n = 114, err = 49.1%)

| | | [15] educ in Higher

| | | | [16] Age in 24-34: 0 (n = 119, err = 35.3%)

| | | | [17] Age in 35-44: 0 (n = 216, err = 14.8%)

| | | | [18] Age in 45-54, 55-64: 0 (n = 198, err = 7.1%)

| | | [19] educ in Professional: 0 (n = 303, err = 35.6%)

| [20] Professional_status in Blue_collar_unqal, White_collar_unqual

| | [21] City_size in Moskow_or_spb: 0 (n = 110, err = 14.5%)

| | [22] City_size in City: 0 (n = 377, err = 38.5%)

| | [23] City_size in Small_town_or_village: 1 (n = 394, err = 41.4%)

| | [24] City_size in Town: 0 (n = 346, err = 49.1%)

Number of inner nodes: 8

Number of terminal nodes: 16

Спецификация дерева C50

C5.0.formula(formula = TARGET ~., data = train)

Class specified by attribute `outcome'

Read 5675 cases (10 attributes) from undefined.data

Decision tree:

Professional_status in {White_collar_qual,Blue_collar_qual}:

:...sector in {Industry,Commercial_services,Construction}: 0 (2629.7/329)

: sector in {Agriculture,Non-comercial_services}:

: :...educ = Higher: 0 (704.8/116)

: educ in {Secondary or lower,Professional}:

: :...City_size in {Moskow_or_spb,City,Town}: 0 (339.4/107.3)

: City_size = Small_town_or_village:

: :...Age = 24-34: 0 (67/28.2)

: Age = 35-44:

: :...Children_under_18 <= 0: 0 (22.3/6.4)

: : Children_under_18 > 0: 1 (74.6/33.2)

: Age = 45-54:

: :...Married <= 0: 0 (32.2/11.4)

: : Married > 0: 1 (51.4/22.1)

: Age = 55-64:

: :...Married <= 0: 1 (18.3/3.5)

: Married > 0:

: :...Male <= 0: 0 (21.6/7)

: Male > 0: 1 (19.7/6.5)

Professional_status in {Blue_collar_unqal,White_collar_unqual}:

:...City_size = Moskow_or_spb: 0 (170.3/22)

City_size in {City,Small_town_or_village,Town}:

:...Age = 55-64: 1 (235.3/74.3)

Age in {24-34,35-44,45-54}:

:...educ in {Higher,Professional}: 0 (589.7/206.6)

educ = Secondary or lower:

:...Male > 0:

:...Children_under_18 <= 0: 1 (55.7/24.8)

: Children_under_18 > 0: 0 (185.8/59.6)

Male <= 0:

:...sector = Construction: 0 (1)

sector = Non-comercial_services: 1 (113.8/27.9)

sector in {Industry,Agriculture,Commercial_services}:

:...City_size = City: 0 (108.6/40.7)

City_size in {Small_town_or_village,

Town}: 1 (233.9/90.5)

Спецификация дерева C4.5/J48

J48 pruned tree

------------------

Professional_status = White_collar_qual: 0 (1932.0/358.0)

Professional_status = Blue_collar_qual: 0 (1145.0/201.0)

Professional_status = Blue_collar_unqal

| City_size = Moskow_or_spb: 0 (29.0/2.0)

| City_size = City: 0 (72.0/26.0)

| City_size = Small_town_or_village: 1 (93.0/37.0)

| City_size = Town: 1 (77.0/31.0)

Professional_status = White_collar_unqual

| City_size = Moskow_or_spb: 0 (81.0/14.0)

| City_size = City

| | Age = 24-34: 0 (94.0/38.0)

| | Age = 35-44: 0 (84.0/21.0)

| | Age = 45-54: 0 (72.0/25.0)

| | Age = 55-64: 1 (55.0/20.0)

| City_size = Small_town_or_village

| | educ = Secondary or lower: 1 (182.0/60.0)

| | educ = Higher: 0 (39.0/11.0)

| | educ = Professional: 1 (80.0/38.0)

| City_size = Town

| | Age = 24-34: 0 (70.0/32.0)

| | Age = 35-44: 0 (96.0/30.0)

| | Age = 45-54: 1 (70.0/34.0)

| | Age = 55-64: 1 (33.0/7.0)

Number of Leaves : 18

Size of the tree: 24

Спецификация дерева LMT

Logistic model tree

------------------

Professional_status = White_collar_qual: LM_1:1/2 (1932)

Professional_status = Blue_collar_qual

| City_size = Moskow_or_spb: LM_2:1/3 (92)

| City_size = City: LM_3:1/3 (333)

| City_size = Small_town_or_village: LM_4:1/3 (335)

| City_size = Town

| | Male <= 0: LM_5:1/4 (68)

| | Male > 0: LM_6:1/4 (317)

Professional_status = Blue_collar_unqal

| City_size = Moskow_or_spb: LM_7:1/3 (29)

| City_size = City

| | Male <= 0: LM_8:1/4 (44)

| | Male > 0: LM_9:1/4 (28)

| City_size = Small_town_or_village: LM_10:1/3 (93)

| City_size = Town: LM_11:1/3 (77)

Professional_status = White_collar_unqual

| City_size = Moskow_or_spb: LM_12:1/3 (81)

| City_size = City

| | Age = 24-34

| | | Male <= 0: LM_13:1/5 (69)

| | | Male > 0: LM_14:1/5 (25)

| | Age = 35-44: LM_15:1/4 (84)

| | Age = 45-54: LM_16:1/4 (72)

| | Age = 55-64: LM_17:1/4 (55)

| City_size = Small_town_or_village

| | educ = Secondary or lower: LM_18:1/4 (182)

| | educ = Higher: LM_19:1/4 (39)

| | educ = Professional: LM_20:1/4 (80)

| City_size = Town

| | Age = 24-34: LM_21:1/4 (70)

| | Age = 35-44: LM_22:1/4 (96)

| | Age = 45-54: LM_23:1/4 (70)

| | Age = 55-64: LM_24:1/4 (33)

Number of Leaves : 24

Size of the Tree: 34

Категориальная небинарная целевая переменная

«Дороги, ведущие молодежь в NEET: случай России»

Таблица 10

Спецификация регрессионной модели

Employed

NEET-inactive

NEET-unemployed

(1)

(2)

(3)

statusCity

0.804**

0.366

0.445

statusMoscow

0.917*

1.503**

1.448

statusTown or village

0.353

1.129**

-0.326

Health_statusGood health

0.179

0.439

0.269

Ln_inc

0.125***

0.092*

-0.135*

age15-19

-1.821***

-2.523***

-3.325***

educationCollege or higher

-0.759

-1.344**

-1.315

educationComplete middle

0.272

1.023*

1.164

educationComplete middle plus tech school

0.142

0.066

0.390

educationIncomplete middle plus tech school

0.992

1.681*

1.684

educationProfessional middle

2.030**

1.281

2.288*

marriedMarried

0.693

-0.248

0.309

Constant

-0.464

-1.055

-0.677

AIC

1,125.378

1,125.378

1,125.378

*p <.05; **p <.01; ***p <.001

Спецификация дерева Rpart

n= 678

node), split, n, loss, yval, (yprob)

* denotes terminal node

1) root 678 367 Students (0.45870206 0.34365782 0.15191740 0.04572271)

2) age=15-19 342 97 Students (0.71637427 0.18128655 0.08771930 0.01461988) *

3) age=20-24 336 165 Employed (0.19642857 0.50892857 0.21726190 0.07738095)

6) Ln_inc>=7.523116 236 88 Employed (0.12288136 0.62711864 0.22033898 0.02966102)

12) status=Large city,City,Moscow 157 50 Employed (0.15923567 0.68152866 0.12738854 0.03184713) *

13) status=Town or village 79 38 Employed (0.05063291 0.51898734 0.40506329 0.02531646)

26) education=College or higher,Professional middle 36 14 Employed (0.08333333 0.61111111 0.25000000 0.05555556) *

27) education=Incomplete middle,Complete middle,Complete middle plus tech school 43 20 NEET-inactive (0.02325581 0.44186047 0.53488372 0.00000000) *

7) Ln_inc< 7.523116 100 63 Students (0.37000000 0.23000000 0.21000000 0.19000000)

14) education=College or higher 52 23 Students (0.55769231 0.19230769 0.11538462 0.13461538) *

15) education=Incomplete middle,Complete middle,Complete middle plus tech school,Incomplete middle plus tech school,Professional middle 48 33 NEET-inactive (0.16666667 0.27083333 0.31250000 0.25000000) *

Спецификация дерева CHAID

Model formula:

Ocupation ~ status + Health_status + age + education + married

Fitted party:

[1] root

| [2] age in 20-24

| | [3] education in Incomplete middle, Complete middle, Complete middle plus tech school, Incomplete middle plus tech school, Professional middle

| | | [4] Health_status in Bad health: Employed (n = 33, err = 54.5%)

| | | [5] Health_status in Good health

| | | | [6] education in Incomplete middle, College or higher, Complete middle, Incomplete middle plus tech school: NEET-inactive (n = 50, err = 48.0%)

| | | | [7] education in Complete middle plus tech school, Professional middle

| | | | | [8] married in Not married: Employed (n = 80, err = 42.5%)

| | | | | [9] married in Married: Employed (n = 26, err = 11.5%)

| | [10] education in College or higher: Employed (n = 135, err = 54.8%)

| [11] age in 15-19

| | [12] education in Incomplete middle, College or higher, Complete middle plus tech school: Students (n = 218, err = 24.8%)

| | [13] education in Complete middle, Incomplete middle plus tech school: Students (n = 50, err = 52.0%)

| | [14] education in Professional middle: Employed (n = 3, err = 0.0%)

Number of inner nodes: 6

Number of terminal nodes: 8

Спецификация дерева C50

Call:

C5.0.formula(formula = Ocupation ~., data = train)

C5.0 [Release 2.07 GPL Edition]

-------------------------------

Class specified by attribute `outcome'

Read 678 cases (7 attributes) from undefined.data

Decision tree:

age = 15-19: Students (342/97)

age = 20-24:

:...Ln_inc > 7.467942: Employed (234.6/88.4)

Ln_inc <= 7.467942:

:...education in {Incomplete middle,

: Incomplete middle plus tech school}: NEET-inactive (8.1/3.1)

education in {College or higher,

: Complete middle plus tech school}: Students (66.5/33.5)

education in {Complete middle,

Professional middle}: Employed (26.9/16.6)

1: 0.6681416 0.3948913 0.8571129

Спецификация дерева C4.5/J48

J48 unpruned tree

------------------

age = 20-24

| Ln_inc <= 7.467942: Students (98.0/61.0)

| Ln_inc > 7.467942: Employed (218.0/83.0)

age = 15-19: Students (258.0/82.0)

Number of Leaves : 3

Size of the tree: 5

Спецификация дерева LMT

Logistic model tree

------------------

age = 20-24

| Ln_inc <= 7.467942: LM_1:1/3 (98)

| Ln_inc > 7.467942: LM_2:1/3 (218)

age = 15-19: LM_3:1/2 (258)

Number of Leaves : 3

Size of the Tree: 5

Доступность добровольного Медицинского страхования в России

Таблица 11

Спецификация регрессионной модели

Insurance paid by employer

Insurance self-paid

City_sizeCity or Town

-0.214

-0.295***

City_sizeSmall_town_or_village

-1.099***

-1.534***

MaritalDivorced or Widowed

-0.584**

0.441***

MaritalWas never married

-0.346***

0.182***

Male

-0.027

-0.388***

Is_russian

-0.326

-1.064***

v_age

0.017

0.210***

age_sq

-0.0004

-0.002***

Ln_inc

0.871***

-0.049

educHigher

0.503**

0.238***

educProfessional

-0.094

-0.602***

HealthBad

-0.230***

-0.858***

HealthGood

-0.182

0.147***

Children_under_18

0.204*

0.230

professionOfficials or Directors

1.704***

2.404***

professionQualified workers

1.924***

1.021***

professionSpecialist w/ higher ed

1.689***

2.188***

professionSpecialists w/ sec ed

1.738***

1.311***

professionUnqualified workers

1.485***

0.923***

Constant

-12.161***

-10.854***

AIC

2,208.950

2,208.950

*p <.05; **p <.01; ***p <.001

Спецификация дерева Rpart

n= 14196

node), split, n, loss, yval, (yprob)

* denotes terminal node

1) root 14196 2839.200000 Insurance self-paid (0.110000000 0.090000000 0.800000000)

2) v_age>=19.5 10727 2434.832000 Insurance self-paid (0.132916391 0.143046788 0.724036821)

4) v_age>=73.5 1005 11.650210 No insurance (0.906804564 0.093195436 0.000000000) *

5) v_age< 73.5 9722 2309.824000 Insurance self-paid (0.121794034 0.143763254 0.734442712)

10) profession=Qualified workers 1628 523.163900 Insurance self-paid (0.164524134 0.331124380 0.504351486)

20) v_age< 52.5 1307 139.604800 Insurance paid by employer (0.329962199 0.670037801 0.000000000)

40) Ln_inc< 8.800001 493 42.717450 No insurance (0.560732660 0.439267340 0.000000000) *

41) Ln_inc>=8.800001 814 85.075210 Insurance paid by employer (0.261090019 0.738909981 0.000000000) *

21) v_age>=52.5 321 100.070600 Insurance self-paid (0.053845038 0.104389190 0.841765772) *

11) profession=Unemployed,Officials or Directors,Specialist w/ higher ed,Specialists w/ sec ed,Unqualified workers 8094 1786.660000 Insurance self-paid (0.115892541 0.117886640 0.766220819)

22) City_size=Small_town_or_village 2564 380.217400 Insurance self-paid (0.314514522 0.102131308 0.583354170)

44) v_age>=54.5 872 0.000000 No insurance (1.000000000 0.000000000 0.000000000) *

45) v_age< 54.5 1692 281.566400 Insurance self-paid (0.231429995 0.114510171 0.654059833)

90) v_age< 34.5 671 42.717450 No insurance (0.636089604 0.363910396 0.000000000)

180) Ln_inc< 8.406469 418 7.766809 No insurance (0.858346665 0.141653335 0.000000000) *

181) Ln_inc>=8.406469 253 27.604190 Insurance paid by employer (0.441279929 0.558720071 0.000000000) *

91) v_age>=34.5 1021 164.181800 Insurance self-paid (0.163233869 0.072479465 0.764286666)

182) Ln_inc>=7.829703 711 129.337200 Insurance self-paid (0.257028166 0.164557888 0.578413945)

364) v_age< 46.5 445 38.834040 No insurance (0.558936779 0.441063221 0.000000000) *

365) v_age>=46.5 266 41.290780 Insurance self-paid (0.135505439 0.053260361 0.811234200) *

183) Ln_inc< 7.829703 310 34.844630 Insurance self-paid (0.089403755 0.000000000 0.910596245) *

23) City_size=Moskow_or_spb,City or Town 5530 1406.442000 Insurance self-paid (0.088959777 0.120023034 0.791017189)

46) v_age< 22.5 274 15.533620 No insurance (0.662893358 0.337106642 0.000000000) *

47) v_age>=22.5 5256 1360.363000 Insurance self-paid (0.085003020 0.118526437 0.796470543)

94) Marital=Married 3636 1005.127000 Insurance self-paid (0.097357351 0.152563854 0.750078795)

188) v_age< 27.5 283 30.658750 Insurance paid by employer (0.396827722 0.603172278 0.000000000) *

189) v_age>=27.5 3353 927.867900 Insurance self-paid (0.091491752 0.143737978 0.764770271)

378) Health=Bad 332 19.417020 No insurance (0.655794692 0.344205308 0.000000000) *

379) Health=Average,Good 3021 871.456700 Insurance self-paid (0.083304480 0.140829468 0.775866051)

758) profession=Unemployed,Specialists w/ sec ed 2103 516.237800 Insurance self-paid (0.144765904 0.181773068 0.673461028)

1516) v_age>=56.5 598 31.067230 No insurance (0.682388066 0.317611934 0.000000000) *

1517) v_age< 56.5 1505 418.422700 Insurance self-paid (0.109308589 0.172814208 0.717877203)

3034) Ln_inc>=8.93378 517 231.695500 Insurance self-paid (0.090452240 0.304531155 0.605016605)

6068) v_age>=39.5 263 27.038530 Insurance paid by employer (0.224870911 0.775129089 0.000000000) *

6069) v_age< 39.5 254 111.455200 Insurance self-paid (0.055795114 0.183197028 0.761007858) *

3035) Ln_inc< 8.93378 988 186.727300 Insurance self-paid (0.121646249 0.086632151 0.791721601)

6070) v_age< 34.5 355 19.417020 No insurance (0.670971535 0.329028465 0.000000000) *

6071) v_age>=34.5 633 127.714100 Insurance self-paid (0.082939507 0.069552341 0.847508152) *

759) profession=Officials or Directors,Specialist w/ higher ed,Unqualified workers 918 355.218900 Insurance self-paid (0.041189364 0.112773748 0.846036888) *

95) Marital=Divorced or Widowed,Was never married 1620 355.235500 Insurance self-paid (0.066338567 0.067104003 0.866557430)

190) Ln_inc< 9.716201 1201 218.364900 Insurance self-paid (0.143218101 0.092047106 0.764734794)

380) Health=Average 650 34.950640 No insurance (0.674781591 0.325218409 0.000000000)

760) profession=Unemployed,Officials or Directors 369 0.000000 No insurance (1.000000000 0.000000000 0.000000000) *

761) profession=Specialist w/ higher ed,Specialists w/ sec ed,Unqualified workers 281 30.771880 Insurance paid by employer (0.468209118 0.531790882 0.000000000) *

381) Health=Bad,Good 551 110.896700 Insurance self-paid (0.073611176 0.061513900 0.864874924) *

191) Ln_inc>=9.716201 419 136.870600 Insurance self-paid (0.025185077 0.053752000 0.921062923) *

3) v_age< 19.5 3469 404.368100 Insurance self-paid (0.072368648 0.002891068 0.924740283)

6) City_size=Moskow_or_spb,Small_town_or_village 1453 171.355600 Insurance self-paid (0.154532312 0.007336829 0.838130858)

12) v_age>=10.5 602 3.883404 No insurance (0.945970549 0.054029451 0.000000000) *

13) v_age< 10.5 851 99.479880 Insurance self-paid (0.096882116 0.003935631 0.899182253) *

7) City_size=City or Town 2016 233.012500 Insurance self-paid (0.052208332 0.001800222 0.945991446) *

Спецификация дерева CHAID

Model formula:

TARGET ~ City_size + Marital + Male + Is_russian + educ + Health +

profession

Fitted party:

[1] root

| [2] profession in Unemployed

| | [3] educ in Secondary or lower, Professional: No insurance (n = 4379, err = 0.3%)

| | [4] educ in Higher

| | | [5] Marital in Married

| | | | [6] City_size in Moskow_or_spb: No insurance (n = 112, err = 3.6%)

| | | | [7] City_size in City or Town, Small_town_or_village: No insurance (n = 392, err = 0.5%)

| | | [8] Marital in Divorced or Widowed, Was never married: No insurance (n = 370, err = 0.8%)

| [9] profession in Officials or Directors, Qualified workers, Specialist w/ higher ed, Specialists w/ sec ed

| | [10] City_size in Moskow_or_spb

| | | [11] educ in Secondary or lower, Professional: No insurance (n = 328, err = 5.8%)

| | | [12] educ in Higher: No insurance (n = 292, err = 15.1%)

| | [13] City_size in City or Town

| | | [14] educ in Secondary or lower, Professional

| | | | [15] Male in 0: No insurance (n = 1018, err = 3.5%)

| | | | [16] Male in 1: No insurance (n = 1164, err = 6.4%)

| | | [17] educ in Higher

| | | | [18] Is_russian in 0: No insurance (n = 141, err = 12.1%)

| | | | [19] Is_russian in 1: No insurance (n = 1335, err = 7.8%)

| | [20] City_size in Small_town_or_village: No insurance (n = 1490, err = 2.2%)

| [21] profession in Unqualified workers

| | [22] Is_russian in 0: No insurance (n = 50, err = 6.0%)

| | [23] Is_russian in 1: No insurance (n = 358, err = 1.4%)

Number of inner nodes: 10

Number of terminal nodes: 13

Спецификация дерева C50

Call:

C5.0.formula(formula = TARGET ~., data = train, control = C5.0Control(earlyStopping = T,

CF = 0.53, fuzzyThreshold = F, noGlobalPruning = T))

Class specified by attribute `outcome'

Read 14196 cases (12 attributes) from undefined.data

Decision tree:

profession = Unemployed: No insurance (6530.7/36.6)

profession in {Officials or Directors,Qualified workers,

: Specialist w/ higher ed,Specialists w/ sec ed,

: Unqualified workers}:

:...v_age <= 19: No insurance (1424.9/14.6)

v_age > 19:

:...Ln_inc <= 8.944289: No insurance (3084/79.3)

Ln_inc > 8.944289:

:...Ln_inc <= 11.05618: No insurance (3112.9/242.4)

Ln_inc > 11.05618:

:...Is_russian <= 0: Insurance paid by employer (4.1/0.1)

Is_russian > 0:

:...Health in {Bad,Good}: No insurance (18.6/5.1)

Health = Average: [S1]

Спецификация дерева C4.5/J48

J48 unpruned tree

------------------

Ln_inc <= 8.944289

| profession = Unemployed

| | Is_russian <= 0: No insurance (398.0/2.0)

| | Is_russian > 0

| | | Children_under_18 <= 0

| | | | Health = Average

| | | | | educ = Secondary or lower

| | | | | | City_size = Moskow_or_spb: No insurance (38.0)

| | | | | | City_size = City or Town: No insurance (225.0/1.0)

| | | | | | City_size = Small_town_or_village: No insurance (230.0/1.0)

| | | | | educ = Higher: No insurance (145.0/1.0)

| | | | | educ = Professional: No insurance (238.0)

| | | | Health = Bad: No insurance (462.0)

| | | | Health = Good: No insurance (117.0)

| | | Children_under_18 > 0

| | | | Health = Average: No insurance (203.0/3.0)

| | | | Health = Bad: No insurance (31.0)

| | | | Health = Good: No insurance (245.0)

| profession = Officials or Directors: No insurance (100.0/2.0)

| profession = Qualified workers

| | City_size = Moskow_or_spb: No insurance (24.0)

| | City_size = City or Town: No insurance (347.0/15.0)

| | City_size = Small_town_or_village: No insurance (231.0/4.0)

| profession = Specialist w/ higher ed: No insurance (341.0/11.0)

| profession = Specialists w/ sec ed

| | City_size = Moskow_or_spb: No insurance (73.0/4.0)

| | City_size = City or Town

| | | educ = Secondary or lower: No insurance (250.0/8.0)

| | | educ = Higher: No insurance (227.0/9.0)

| | | educ = Professional: No insurance (228.0/2.0)

| | City_size = Small_town_or_village

| | | v_age <= 40: No insurance (192.0/1.0)

| | | v_age > 40: No insurance (207.0)

| profession = Unqualified workers: No insurance (206.0/2.0)

Ln_inc > 8.944289

| v_age <= 59

| | profession = Unemployed: No insurance (173.0)

| | profession = Officials or Directors: No insurance (191.0/27.0)

| | profession = Qualified workers

| | | Health = Average: No insurance (279.0/28.0)

| | | Health = Bad: No insurance (19.0/1.0)

| | | Health = Good: No insurance (296.0/19.0)

| | profession = Specialist w/ higher ed: No insurance (369.0/43.0)

| | profession = Specialists w/ sec ed

| | | Health = Average

| | | | Ln_inc <= 9.36743: No insurance (208.0/12.0)

| | | | Ln_inc > 9.36743: No insurance (195.0/27.0)

| | | Health = Bad: No insurance (20.0)

| | | Health = Good: No insurance (320.0/30.0)

| | profession = Unqualified workers: No insurance (78.0/4.0)

| v_age > 59

| | Health = Average

| | | educ = Secondary or lower: No insurance (269.0/1.0)

| | | educ = Higher: No insurance (252.0/8.0)

| | | educ = Professional: No insurance (231.0/3.0)

| | Health = Bad: No insurance (437.0)

| | Health = Good: No insurance (98.0/4.0)

Number of Leaves : 39

Size of the tree: 58

Спецификация дерева LMT

Logistic model tree

------------------

: LM_1:14/14 (8193)

Number of Leaves : 1

Size of the Tree: 1

LM_1:

Class No insurance:

3.1 +

[City_size=Moskow_or_spb] * -0.15 +

[City_size=Small_town_or_village] * 0.46 +

[Is_russian] * 0.2 +

[age_sq] * 0 +

[Ln_inc] * -0.11 +

[educ=Higher] * -0.43 +

[educ=Professional] * 0.08 +

[Health=Average] * -0.08 +

[Children_under_18] * -0.15 +

[profession=Unemployed] * 1.07 +

[profession=Unqualified workers] * 0.15

Class Insurance paid by employer:

-6.92 +

[City_size=Small_town_or_village] * -0.49 +

[Marital=Married] * 0.25 +

[Marital=Divorced or Widowed] * -0.15 +

[age_sq] * -0 +

[Ln_inc] * 0.69 +

[Children_under_18] * 0.06 +

[profession=Unemployed] * -0.69 +

[profession=Qualified workers] * 0.1

Class Insurance self-paid:

-3.32 +

[City_size=Small_town_or_village] * -0.25 +

[Marital=Divorced or Widowed] * 0.23 +

[Male] * -0.26 +

[Is_russian] * -0.66 +

[v_age] * 0.02 +

[educ=Higher] * 0.18 +

[Health=Bad] * -0.35 +

[profession=Officials or Directors] * 0.72 +

[profession=Specialist w/ higher ed] * 0.5

Внутрифирменная мобильность на российском рынке труда

Таблица 11

Спецификация регрессионной модели

External

Horizontal

Mixed

Vertical

-1

-2

-3

-4

Male

0.208*

0.188

0.516

0.185

Agegroup30-39

-0.388***

0.039

-0.139

-0.731***

Agegroup40-49

-0.642***

-0.059

-0.728

-1.022***

Agegroup50 plus

-1.051***

-0.545

-1.134

-1.623***

educationHigher

-0.226***

-0.498*

-0.926

-0.325*

educationLess than secondary

-0.354**

-0.849*

-10.922

-0.896**

educationProfessional

-0.294***

-0.483**

-0.718

-0.431**

Married

-0.258**

-0.127

0.298

-0.071

City_sizeCity

0.317*

-0.300

-0.703

-0.153

City_sizeMoskow_or_spb

0.323

-0.081

0.001

-0.439

City_sizeTown

0.046

-0.452

-0.434

-0.283

Firm_size101-500

-0.278

1.121**

1.726**

0.090

Firm_size501-100

-0.448

0.658

1.111

-0.536

Firm_size51-100

-0.415*

0.464

1.269

0.029

Firm_sizeMore than 1000

-0.106

0.603

1.379*

0.295

OwnershipForeign

0.544

-7.597

0.063

-0.396

OwnershipPrivate

0.747***

-0.227

-0.953*

-0.081

Professional_statusBlue_collar_qual

-0.446*

-0.422

-0.266

0.210

Professional_statusWhite_collar_qual

-0.454*

-0.111

1.209

0.937

Professional_statusWhite_collar_unqual

-0.254

0.247

0.149

1.032

sectorCommercial_services

0.343

0.114

9.072

0.254

sectorConstruction

0.644*

0.136

8.502

-0.314

sectorIndustry

0.157

0.837

9.380

0.076

sectorNon-comercial_services

0.134

0.003

8.422

0.098

Subordinates1-49

-0.458**

-0.248

0.321

1.013***

Subordinates50 plus

-0.168

0.468

0.955

1.317**

educFull secondary

-0.440***

-1.422***

-0.812

-1.060***

educHigher

-0.226***

-0.498*

-0.926

-0.325*

educProfessional

-0.294***

-0.483**

-0.718

-0.431**

Constant

-1.314***

-3.251***

-13.378

-2.710***

AIC

6,126.268

6,126.268

6,126.268

6,126.268

*p <.05; **p <.01; ***p <.001

Спецификация дерева Rpart

n= 6024

node), split, n, loss, yval, (yprob)

* denotes terminal node

1) root 6024 3012.00000 Immobile (0.5 0.1 0.15 0.1 0.15)

2) Agegroup=50 plus 1510 420.52830 Immobile (0.66 0.071 0.12 0.062 0.08) *

3) Agegroup=Less than 30,30-39,40-49 4514 2591.47200 Immobile (0.46 0.11 0.16 0.11 0.17)

6) Subordinates=No subordinates 3613 1908.50700 Immobile (0.48 0.12 0.18 0.1 0.12)

12) Firm_size=Less than 50 1438 591.07070 Immobile (0.54 0.15 0.11 0.048 0.15) *

13) Firm_size=101-500,501-100,51-100,More than 1000 2175 1317.43700 Immobile (0.45 0.11 0.22 0.13 0.1)

26) sector=Commercial_services,Construction,Non-comercial_services 1517 840.02080 Immobile (0.47 0.12 0.16 0.13 0.12)

52) City_size=City,Town 1058 484.56930 Immobile (0.52 0.13 0.15 0.062 0.14) *

53) City_size=Small_town_or_village,Moskow_or_spb 459 355.45140 Immobile (0.38 0.099 0.18 0.24 0.098)

106) Profe...


Подобные документы

  • Разработка и принятие правильного решения как задачи работы управленческого персонала организации. Деревья решений - один из методов автоматического анализа данных, преимущества их использования и область применения. Построение деревьев классификации.

    контрольная работа [91,6 K], добавлен 08.09.2011

  • Этапы построения деревьев решений: правило разбиения, остановки и отсечения. Постановка задачи многошагового стохастического выбора в предметной области. Оценка вероятности реализации успешной и неуспешной деятельности в задаче, ее оптимальный путь.

    реферат [188,8 K], добавлен 23.05.2015

  • Использование системного анализа для подготовки и обоснования управленческих решений по многофакторным проблемам. Возникновение синергетики как науки о законах построения организации, возникновения упорядоченности, развитии и самоусложнении системы.

    реферат [40,4 K], добавлен 21.01.2015

  • Понятие сетевого планирования, его особенности, назначение и сферы применения. Правила и этапы построения сетевых графиков, необходимые расчеты и решение типовых задач. Общая характеристика корреляционного и регрессивного анализа, их применение.

    контрольная работа [142,3 K], добавлен 29.04.2009

  • Геометрическая интерпретация, графический и симплексный методы решения задачи линейного программирования. Компьютерная реализация задач стандартными офисными средствами, в среде пакета Excel. Задачи распределительного типа, решаемые в землеустройстве.

    методичка [574,3 K], добавлен 03.10.2012

  • Понятие математического программирования как отрасли математики, являющейся теоретической основой решения задач о нахождении оптимальных решений. Основные этапы нахождения оптимальных решений экономических задач. Примеры задач линейного программирования.

    учебное пособие [2,0 M], добавлен 15.06.2015

  • Теоретические основы прикладного регрессионного анализа. Проверка предпосылок и предположений регрессионного анализа. Обнаружение выбросов в выборке. Рекомендации по устранению мультиколлинеарности. Пример практического применения регрессионного анализа.

    курсовая работа [1,2 M], добавлен 04.02.2011

  • Использование симплексного метода решения задач линейного программирования для расчета суточного объема производства продукции. Проверка плана на оптимальность. Пересчет симплексной таблицы методом Жордана-Гаусса. Составление модели транспортной задачи.

    контрольная работа [613,3 K], добавлен 18.02.2014

  • Математическая формализация оптимизационной проблемы. Геометрическая интерпретация стандартной задачи линейного программирования, планирование товарооборота. Сущность и алгоритм симплекс-метода. Постановка транспортной задачи, последовательность решения.

    учебное пособие [126,0 K], добавлен 07.10.2014

  • Применение линейного программирования для решения транспортной задачи. Свойство системы ограничений, опорное решение задачи. Методы построения начального опорного решения. Распределительный метод, алгоритм решения транспортной задачи методом потенциалов.

    реферат [4,1 M], добавлен 09.03.2011

  • Изучение интуитивных и рациональных методов подхода к решению творческих задач. Темпы технического прогресса напрямую зависят от изобретателей, а экономические успехи зависят от темпов технического прогресса. Методы решения изобретательских задач.

    реферат [22,4 K], добавлен 17.07.2008

  • Сущность метода наименьших квадратов. Экономический смысл параметров кривой роста (линейная модель). Оценка погрешности и проверка адекватности модели. Построение точечного и интервального прогноза. Суть графического построения области допустимых решений.

    контрольная работа [32,3 K], добавлен 23.04.2013

  • Количественное обоснование управленческих решений по улучшению состояния экономических процессов методом математических моделей. Анализ оптимального решения задачи линейного программирования на чувствительность. Понятие многопараметрической оптимизации.

    курсовая работа [4,2 M], добавлен 20.04.2015

  • Навыки применения теоретических знаний по теме "Одномерный регрессионный анализ" при решении экономических задач с помощью системы GRETL. Анализ затрат в зависимости от числа ящиков, готовых к разгрузке. Обоснование результатов регрессионного анализа

    лабораторная работа [27,2 K], добавлен 15.12.2008

  • Решение задач при помощи пакета прикладных программ MatLab. Загрузка в MatLab матриц A и P. Нахождение оптимальной стратегии для заданных матриц с использованием критериев принятия решений в условиях неопределённости Вальда, Гурвица, Лапласа, Сэвиджа.

    лабораторная работа [80,2 K], добавлен 18.03.2015

  • Классическая теория оптимизации. Функция скаляризации Чебышева. Критерий Парето-оптимальность. Марковские процессы принятия решений. Метод изменения ограничений. Алгоритм нахождения кратчайшего пути. Процесс построения минимального остовного дерева сети.

    контрольная работа [182,8 K], добавлен 18.01.2015

  • Связь между случайными переменными и оценка её тесноты как основная задача корреляционного анализа. Регрессионный анализ, расчет параметров уравнения линейной парной регрессии. Оценка статистической надежности результатов регрессионного моделирования.

    контрольная работа [50,4 K], добавлен 07.06.2011

  • Построение экономических и математических моделей принятия решений в условиях неопределенности. Общая методология оптимизационных задач, оценка преимуществ выбранного варианта. Двойственность и симплексный метод решения задач линейного программирования.

    курс лекций [496,2 K], добавлен 17.11.2011

  • Основные понятия линейной алгебры и выпуклого анализа, применяемые в теории математического программирования. Характеристика графических методов решения задачи линейного программирования, сущность их геометрической интерпретации и основные этапы.

    курсовая работа [609,5 K], добавлен 17.02.2010

  • Классические подходы к анализу финансовых рынков, алгоритмы машинного обучения. Модель ансамблей классификационных деревьев для прогнозирования динамики финансовых временных рядов. Выбор алгоритма для анализа данных. Практическая реализация модели.

    дипломная работа [1,5 M], добавлен 21.09.2016

Работы в архивах красиво оформлены согласно требованиям ВУЗов и содержат рисунки, диаграммы, формулы и т.д.
PPT, PPTX и PDF-файлы представлены только в архивах.
Рекомендуем скачать работу.