Confidence Interval for Willingness to Pay

0 downloads 0 Views 1MB Size Report
confidence interval for the value of time in private transport was very unstable for .... 1 This was defined as per capita family income divided by available time. ..... 0.975 determine the lower and upper extremes of the 95% confidence interval.
Netw Spat Econ (2006) 6: 81–96 DOI 10.1007/s11067-006-7694-3

Confidence Interval for Willingness to Pay Measures in Mode Choice Models Raquel Espino & Juan de Dios Ortu´zar & Concepcio´n Roma´n

# Springer Science + Business Media, LLC 2006

Abstract We obtain confidence intervals for willingness-to-pay (WTP) measures derived from a mode choice model estimated to analyse travel demand for suburban trips in the two main interurban corridors in Gran Canaria using a mixed RP/SP data base. We considered a specification of the systematic utility that incorporates income effect, and interactions among socioeconomic variables and level-of-service attributes, as well as between travel cost and frequency. As our model provides rather complex expressions of the marginal utilities, we simulated the distribution of the WTP (in general, unknown) from a multivariate Normal distribution of the parameter vector. For every random draw of the parameter vector, the corresponding simulation of the WTP was obtained applying the sample enumeration method to the individuals in the RP database. The extremes of the confidence interval were determined by the percentiles on this distribution. After trying different simulation strategies we observed that the size of the intervals was strongly affected by the outliers as well as by the magnitude of the simulated parameters. In all cases examined the simulated distribution of the corresponding WTP measure presents an asymmetric shape that was very similar for the two model specifications considered. This is consistent with previous findings using a radically different approach. We also observed that the upper extreme of the confidence interval for the value of time in private transport was very unstable for different numbers of random draws. Keywords Confidence intervals . Willingness to pay . Mode choice

R. Espino (*) : C. Roma´n Department of Applied Economic Analysis, Universidad de Las Palmas de Gran Canaria Edificio Departamental de C.C. E.E. y E.E., mo´dulo D, Campus de Tafira, 35017 Las Palmas, Spain e-mail: {respino;croman}@daea.ulpgc.es J. de D. Ortu´zar Department of Transport Engineering, Pontificia Universidad Cato´lica de Chile, Casilla 306, co´digo 105, Santiago 22, Chile e-mail: [email protected]

82

Espino, Ortu´zar, and Roma´n

1. Introduction Most applied research concerning the value of travel time savings (VTTS) or other willingness to pay (WTP) measures is based on the formulation of simplified models that impose strong restrictions on the distribution of the random error terms (e.g., IIA property of the multinomial logit model). Also, the specification of the systematic utility does not consider the interaction of other variables with the main policy attributes. WTP measures are derived as the ratio of the marginal utility of the corresponding attribute and the marginal utility of income. In most cases (i.e., linear systematic utilities) these yield a fixed value equal to the time (or other attribute) parameter divided by the cost parameter of the utility function. However, as both the numerator and denominator in the WTP expression are obtained from point estimates of the corresponding parameters, the computed WTP measure is also a point estimate of the true value of the WTP. As maximum likelihood estimators in discrete choice models are asymptotically distributed multivariate Normal, the probability distribution of the WTP estimator is unknown. In fact, obtaining confidence intervals for the WTP estimates is not straightforward in most cases (Armstrong et al., 2001). In spite of the fact that obtaining confidence intervals (instead of point estimates) for the WTP measures may be relevant in the evaluation of transport projects, the research work done is this line is rather scarce (Garrido and Ortu´zar, 1993; Ettema et al., 1997); Armstrong et al. (2001) compare the results of applying different methods proposed in the literature to calculate confidence intervals for the value of time. They also derive the properties of two methods that are simple to apply in many circumstances: the asymptotic t-test and the likelihood ratio test. The first one allows the analyst to obtain an explicit expression for the extremes of the confidence interval in terms of the estimated parameters, their t-ratios and correlation coefficient. The main drawback of this method is the difficulty to obtain an explicit expression when the specification of utility is not linear in the attributes. On the other hand, the likelihood ratio test approach obtains the extremes of the confidence interval from the application of a search algorithm that requires the estimation of a restriction of the original model at every step. Armstrong et al. (2001) show that the results of these methods are similar to those obtained using a multivariate Normal simulation method proposed by Ettema et al. (1997). The latter can be considered the closest to reality as it obtains the simulated distribution of the WTP from simulations of the true distributions of the parameters involved in the WTP expression; it also seems to be more appropriate when the utility specification yields WTP expressions different from a simple ratio between two parameters. The availability of specialised software, as well as recent improvements in simulation procedures, has made it possible an easy implementation of the multivariate Normal simulation method. The aim of this paper is to provide confidence intervals for WTP measures obtained from the estimation of a mode choice model for suburban trips in the two main interurban corridors in Gran Canaria. The analysis is based on the use of a mixed revealed preference/stated preference (RP/SP) data base. The SP data were obtained from a choice experiment, between car and bus allowing for interactions among the main policy variables. A previous RP survey gathered information about travel decisions in the corridors and this information was used to adapt the SP choice experiment to each individual experience (Espino et al., 2004).

Confidence Interval for Willingness to Pay Measures in Mode Choice Models

83

As we detected the presence of income effect during the specification searches we included income in the utility specification by dividing travel costs by the expenditure rate.1 Also, significant interactions of various types were found and this yields WTP measures that are not constant across individuals. As an example, we found models in which the subjective value of time (SVT) for the bus mode was expressed in terms of the level of comfort, frequency, expenditure rate and if the person worked or not; for car alternatives, SVT was expressed in terms of the expenditure rate, gender and if the individual was employed. We also derived the WTP for increases in frequency, reductions in walking time and improvements in the level of comfort (Espino et al., 2006). As models implied rather complex expressions for the WTP measures, a MATLAB code was created in order to obtain the simulated distribution of the corresponding WTP from multivariate Normal simulations of the parameter vector. Confidence intervals were determined by the 0.025 and 0.975 percentile points. Different simulation strategies were used in order to reduce the amplitude of these intervals. The rest of the paper is organised as follows. Section 2 presents the theoretical background, the data base used in the study, the model specification and the derivation of the different WTP measures. The procedure followed to obtain the confidence intervals is presented in Section 3. Section 4 discuses the estimation and simulation results, and finally Section 5 summarises our main conclusions.

2. Theoretical Background The WTP for reductions in travel time or improvements in any other attribute determining travel demand is represented by the marginal rate of substitution (i.e., the trade-off) between travel cost and the corresponding attribute. WTP measures, also referred as the subjective value of a given attribute(SVk), are derived from the estimation of discrete choice models as the ratio between the marginal utility of the attribute (qkj) and the marginal utility of travel cost (cj); the latter coincides with minus the marginal utility of income (Jara-Dı´az, 1998): @Vj dcj @qkj WTP ¼ SV k ¼  ¼ @Vj dqkj @cj

ð1Þ

Here Vj is the conditional indirect utility of alternative j obtained under the assumption that the individual consumes continuous goods and chooses among a set of discrete alternatives (McFadden, 1981). When Vj is linear (1) yields the quotient between the coefficients of qkj and travel cost. The asymptotic t-test proposed by Armstrong et al. (2001) is based on the null hypothesis that the true value of time is equal to a given point estimate VT as shown in Eq. (2). t ¼ VT c t H 1 : 6¼ VT c

H0 :

1

This was defined as per capita family income divided by available time.

ð2Þ

84

Espino, Ortu´zar, and Roma´n

This null hypothesis can be transformed into the linear restriction: t  VTc ¼ 0. As the parameter estimators qt and q c distribute asymptotically Normal in discrete choice models, any linear combination of them also distributes Normal. Thus, the (1ja) confidence interval for the true value of time is represented by the set of values VT that satisfy the condition: z=2 

bt  VT bc  z=2 b t VT b c

ð3Þ

When utility is linear it is relatively easy to obtain an explicit expression of both the lower and upper extremes of the confidence interval (Armstrong et al., 2001). However, other specifications of the systematic utility, incorporating income effect and/or interactions between variables, yield more complex expressions of the marginal utilities and it may be difficult to obtain an explicit expression of the interval extremes by simply manipulating the inequalities in Eq. (3). The likelihood ratio test approach is based on the same hypothesis test but uses as test statistic the likelihood ratio test. Thus there are two models, the general or unrestricted one where utility is expressed as Viq = q ttiq + qcCiq + . . ., and the restricted model considering the linear restriction given by the null hypothesis in Eq. (2), where utility is expressed as Viq = q c( VT tiq + Ciq) + . . . . In this case the (1-a) confidence interval for the true value of time is given by the set of values VT that satisfy the condition: 2flðr =VT Þ  lðÞg  21;ð1Þ

ð4Þ

where l(qr / VT ) and l(q) are the log-likelihood values of the restricted and unrestricted models, respectively. Armstrong et al. (2001) develop a search algorithm to obtain the extremes of the confidence interval. This method requires the estimation of the restricted model at every step. As the parameter estimators distribute asymptotically Multivariate Normal, it is possible to obtain a simulated distribution of the WTP from random draws of this distribution and then obtain the confidence interval for the WTP measure as mentioned above. This method is appropriate when there are interactions between variables and, in general, for non-linear specifications of the utility function. 2.1. The Data The empirical application is based on previous work (Espino et al., 2006) that analysed mode choice decisions for suburban trips in the two main corridors in Gran Canaria. These corridors connect the capital city of the island (Las Palmas de Gran Canaria) with the cities of Arucas in the North and Telde in the South, covering a distance of around 40 km. The analysis is based on an RP/SP data bank; the RP survey gathered data about actual trip behaviour in the two corridors from a total of 710 respondents. Near 80% of these were car users (most of them car drivers) and 55% of the trips corresponded to commuters travelling for work or educational motives. The SP data were obtained from a choice experiment between car and bus, allowing for interactions between travel time, travel cost and frequency. The experiment also included parking costs for car and

Confidence Interval for Willingness to Pay Measures in Mode Choice Models

85

the latent variable comfort for bus. In order to gain realism, the levels assigned to the attributes in the SP exercise were customised using the information from the RP survey (Espino et al., 2004). 2.2. Model Specification The econometric model is based on random utility theory. It states that the utility of alternative j (this is the CIU of the preceding section) for individual q has the expression: Ujq ¼ Vjq þ ejq

ð5Þ

where Vjq is the representative or systematic utility and ejq is a random term that includes unobserved effects. Vjq depends on the observable attributes of alternative j and on socio-economic characteristics of individual q. Depending on the hypotheses made about the distribution of the random term we can obtain different discrete choice models (Ortu´zar and Willumsen, 2001). For example when ejq are distributed iid Gumbel we get the multinomial logit (MNL) model. On the other hand, the nested logit (NL) model is appropriate when the set of options faced by a decision-maker can be grouped into nests in such a way that the independence of irrelevant alternatives property of the MNL holds for alternatives within the same nest and does not hold for options belonging to different nests (Williams, 1977). The mixed RP/SP estimation methods is based on the hypothesis that the difference between the error terms in both data sets may be represented in terms of the differences between their variances, i.e., 2e ¼ 2 2 , where 2e and 2 are the variances of the RP and SP error terms. To mix the data we postulate the following utility functions for alternative j (Ben-Akiva and Morikawa, 1990): UjRP ¼ VjRP þ ej ¼ XjRP þ YjRP þ ej     UjSP ¼  VjSP þ j ¼  XjSP þ wZjSP þ j

ð6Þ

SP where q, , and w are parameters to be estimated; XRP are common j and Xj RP SP attributes to the RP and SP data sets; and Yj and Zj are attributes that only belong to the designated data set. Bradley and Daly (1997) proposed an estimation method based on building an artificial NL structure where RP alternatives are placed just below the root and each SP alternative is placed in a single-alternative nest with a common scale parameter m (see Ortu´zar and Willumsen, 2001 for details). We detected the presence of income effect using the procedure of Jara-Dı´az and Videla (1989). Therefore, income was specified in the utility function dividing both travel and parking costs by the expenditure rate (g); the latter is defined as per capita family income (PCFI) divided by the available time (i.e., 24 h minus the individual_s working hours). In our sample the share of income spent in transport does not exceed 5.77% for car drivers, 1.68% for car passengers and 2.45% for bus users. Following Jara-Dı´az (1998), this result implies that it is not required to include a cost squared term divided by the expenditure rate in the utility function. The models analysed have this specification.

86

Espino, Ortu´zar, and Roma´n

We also considered two alternative specifications for the latent variable Comfort which had taken three levels in the SP experiment: high, standard, and low. In the first case Comfort was included as a dummy variable, using the linear term qCLCL + qCHCH, where CL is equal to one when Comfort is low and zero otherwise; and CH is equal to one when Comfort is high and zero otherwise. In the second case, Comfort was included in the utility function interacting with Travel time through the non-linear term (q CLCL + q CHCH)*t. When Comfort does not interact with Travel time, the utility specifications for the RP (car-driver, car-passenger and bus) and the SP (car-driver and bus) alternatives are as follows:   RP RP VCar -Diver ¼ C D þ ðt þ t W W Þ  t þ c=g þ c=g S  S  c=g þ pc=g þ pc=g M  M pc=g   RP RP VCar-Pass ¼ C P þ ðt þ t W  W Þ  t þ c=g þ c=g S  S c=g   þ pc=g þ pc=gM  M  pc=g  RP VBus ¼ ðt þ t W  W Þ  t þ c=g þ c=g S  S  c=g   ð7Þ þðwt þ wt O  OÞ  wt þ f þ f A  A  f   SP SP VCar -Driver ¼ C D þ ðt þ t W  W Þ t þ c=g þ c=g S  S  c=g þ pc=g þ pc=g M  M  pc=g   SP VBus ¼ ðt þ t W  W Þ  t þ c=g þ c=g S  S þ f  c=g  f  c=g   þ f þ f A  A  f þ CL  CL þ CH  CH where t is travel time, wt is walking time, c is travel cost, pc is parking cost, and f the frequency of the bus. W, S, M, A and O are socioeconomic variables taking the value one for workers, men, mandatory trips, older than 35 and people who travel in the north corridor, respectively; and cero otherwise. On the other hand, when Comfort interacts with Travel time, the specification for the SP Bus option changes to that in expression (8): SP ¼ ðt þ t W  W þ t  CL  CL þ t  CH  CH Þ  t VBus þc=g þ c=g S S þ f  c=g  f  c=g þ f þ f A  A  f

ð8Þ

2.3. Derivation of the WTP Expressions WTP measures express changes in utility caused by changes in service attributes, in monetary terms, and are obtained from Eq. (1). On the other hand, monetary welfare measures like the compensating variation (CV), defined as the increase in income needed to bring back the individual to its initial utility level after an attribute change, can be obtained as:

CV ¼ Dqkj



@Vj @qkj ¼ Dqkj @Vj @cj

 WTP

ð9Þ

To interpret these measures it is necessary to pose some questions. Equations (1) and (9) are appropriate only if individuals do not choose a different alternative

Confidence Interval for Willingness to Pay Measures in Mode Choice Models

87

after qkj changes; this happens when infinitesimal changes in qkj are considered (see Lancsar and Savage, 2004a,b; Santos Silva, 2004; and Ryan, 2004). Discrete changes in the characteristics of an alternative may produce changes in the choice probabilities. Lancsar and Savage (2004a) suggest that to consider this uncertainty an accurate measure of welfare must be implicitly weighed by the probability of choosing the alternative. Santos Silva (2004) provides some insight in this respect considering a Taylor expansion of the classical result of Small and Rosen (1981) for the expected compensating variation (ECV). Although WTP figures should not be used directly to obtain the social benefits of a given project they are interesting instruments to measure welfare changes. Social benefits can be obtained from private benefit measures, as shown by Small and Rosen (1981) and Jara-Dı´az (1990) for the transport case. In a more recent work, Jara-Dı´az et al. (2000) provide a method to calculate social prices for traffic accident reductions following the social appraisal approach developed by Galvez and Jara-Dı´az (1998). The application of Eq. (1) to the hybrid utility2 obtained from Eqs. (7) and (8) yields the WTP measures for model 1 (comfort specified as a dummy variable) and model 2 (comfort interacting with travel time). Table 1 shows the analytical expression of these measures. As we observe, the specification of interactions as well as the expenditure rate dividing travel costs yield WTP that vary among individuals. As an example, we were able to find models in which the SVT for the bus mode was expressed in terms of the level of comfort, the frequency, the individual_s expenditure rate, and if s/he works or not. For car alternatives, SVT is expressed in terms of the expenditure rate, gender and if the individual works or not. In a similar fashion, we obtained WTP for increases in frequency, reductions in walking time and improvements in the level of comfort. The use of confidence estimates for WTP measures represents an important tool in the evaluation of transport investments. They provide the range of possible benefits derived from a given project. For instance, the confidence interval for the SVT allows planners to consider the lower and upper limits of the benefits obtained from travel time savings. Therefore, its use may be especially important when investment decisions are not straightforward and a thorough cost benefit analysis is needed. In these cases, point estimates of WTP measures provide only partial information. 3. Confidence Intervals for the WTP Measures The WTP expressions shown in Table 1 are far from those obtained when linear models are used (i.e., a single ratio between two parameters). To obtain confidence estimates for these WTP measures a sophisticated procedure based on simulation _of the WTP distribution is called for. The vector  of Maximum Likelihood estimators in a discrete choice model is asymptotically distributed Multivariate Normal N(q,@), where q is the true vector of parameters and @ is the asymptotic covariance matrix of the estimates; this depends 2 This utility is built from common and non-common RP-SP parameters (Louviere et al., 2000). If attributes were only defined for the SP case (i.e., Comfort) their parameters should be scaled by the SP scale factor m.

88

Espino, Ortu´zar, and Roma´n

Table 1 WTP expressions Model 1: Comfort as dummy

Model 2: Comfort interacting with travel time Subjective value of time

Car

Bus

Car

t þ t W W g c=g þ c=g S  S

t þ t W W g c=g þ c=g S  S þ f c=g  f

t þ t W W g c=g þ c=g S  S

Bus t þ t

W

 W þ   tCL  CL þ   tCH  CH g c=g þ c=g S  S þ c=g  f

Subjective value of walking time wt þ wt O  O g c=g þ c=g S  S þ f  c=g  f

Subjective value of frequency f þ f A  A þ f c=g  c=g g c=g þ c=g S  S þ f  c=g  f

WTP for improvements in comfort Low to standard

Low to standard

  CL g c=g þ c=g S  S þ f  c=g  f

  CL t g c=g þ c=g S  S þ f  c=g  f

Standard to high

Standard to high

  CH g c=g þ c=g S  S þ f  c=g  f

  CH t g c=g þ c=g S  S þ f  c=g  f

on q and on the vector of attributes. Since q is unknown, we generally use an estimated covariance matrix S, that is obtained by evaluating @ at the estimated parameters b, and the sample distribution of the attributes to estimate their true distribution (Ben-Akiva and Lerman, 1985). Hence, the asymptotic distribution is approached by N(b,S). A random draw q from the Multivariate Normal N(b,S) is obtained from q = b + Lh, where h is a vector of random draws from the standard Normal and L is the Cholesky factor of S, that is LL0 = S (see for example, Train, 2003). For every simulated vector of parameters we obtain a simulated value of the WTP distribution. As the WTP varies across individuals (see Table 1), the sample enumeration method (see Ortu´zar and Willumsen, 2001) is required to obtain a simulated observation of the WTP. This process is repeated a sufficiently large number of times in order to obtain the simulated distribution. Percentiles 0.025 and 0.975 determine the lower and upper extremes of the 95% confidence interval. During the simulation process we took two important aspects into account. First, as WTP expressions are represented by a quotient between linear combinations of parameters that distribute Normal, the numerator and denominator of the WTP expression also distribute Normal. Hence, a high value for the numerator or/and a value close to zero for the denominator may cause an

Confidence Interval for Willingness to Pay Measures in Mode Choice Models

89

extremely high simulated value for the WTP, moving towards the right3 the upper extreme of the confidence interval for the distribution. Second, a random draw of the parameter vector may produce inconsistent marginal utilities (i.e., with the wrong sign) for some individuals in the sample,4 causing counterintuitive interpretations of the WTP for these individuals. Leaving these observations in the sample during the sample enumeration process may bias the value obtained for the simulated WTP. Four different simulation strategies were used. The first one considered all individuals in the sample during the sample enumeration process, and included the outliers5 of the marginal simulated distributions. The second improved the method removing the outliers during the simulation of the parameter vector, thus it was possible to control some of the outliers of the WTP distribution produced by outliers of de numerator (of the WTP expression) distribution; the third also remove from the sample those individuals for which the simulated parameters provided marginal utilities that were inconsistent with microeconomic principles (that is, with the wrong sign); and finally a fourth strategy removed also the outliers of the simulated distribution of the WTP. In the last case we control not only the outliers produced by high values in the numerator but also those appeared by the obtaining of near-to-zero values in the denominator of the WTP.

4. Results of the Model 4.1. Estimation Results Table 2 shows the estimation results for two different specifications of the utility function. In model 1, Comfort is specified as a dummy variable, while in model 2 this latent variable interacts with Travel time. All parameter estimates have the expected sign (perhaps with the exception of the ASC for Car passenger). As the ASC for the SP Car alternative was not significantly different from zero it was removed from the final specification. Although the base parameters of Travel time and Parking cost/g are not significantly different from zero in both models of Table 2, this is explained by the inclusion of interaction terms with the socio-economic variables (Ortu´zar and Willumsen, 2001). Estimation results show, in general, that Travel time produces more disutility for workers than for non-workers; in fact, the former have less time available and in general exhibit a higher WTP for travel time savings. There are also differences in the perception of Cost between men and women. The parameter corresponding to the interaction term of Cost with Sex (qc/j_s) is positive, which means that the MUI is higher for women than for men. For mandatory trips (work and education), Parking costs produce more disutility than for other motives. Finally, for people older than 35 years of age improvements in the bus frequency are more valued than for the rest of the travellers.

3

This occurs when marginal utilities in the numerator and denominator present the same sign. Note that the WTP took a different value for every individual in the sample. 5 We consider outliers of the simulated distribution those values qk such that jk  bk j > 2k . 4

90

Espino, Ortu´zar, and Roma´n

Table 2 Estimation results Parameters(1) RP C D RP C P

RP CCar -Driver RP CCar -Pass

q q

Travel time (t) Travel time-Worker (t*W) Cost/g (c/g) Cost/g-Sex (c/g*S) Frequency*Cost /g (f*c/g) Parking cost/g (pc/g) Parking c /g-Mandatory (pc/g*M) Walking time (wt) Walking time-Origin (wt*O) Frequency ( f ) Frequency-Age ( f*A) Comfort low (CL) Comfort high (CH) Travel time-C.low (t*CL) Travel time-C.high (t*CH) Scale factor(2) r2 Log-likelihood N- of observations

qt qt w q c/g q c/g S q f c=g q pc=g q pc=g M q wt q wt 0 qf qf A q CL q CH q t.CL q t.CH m r2(c)  l b

Model 1

Model 2

3.321 (6.9) j0.871 (j2.7) j0.018 (j1.7) j0.0549 (j2.9) j0.3218 (j3.1) 0.2024 (2.6) j0.0479 (j2.2) j0.0370 (j0.4) j0.4391 (j2.5) j0.0790 (j2.0) j0.1432 (j2.6) 0.1020 (2.1) 0.1373 (2.3) j1.929 (j3.4) 0.5013 (1.5) – – 0.7225 (3.3)[1.28] 0.1279 j585.191 1,286

3.402 (7.1) j0.841 (j2.6) j0.0156 (j1.4) j0.0572 (j3.0) j0.3542 (j3.5) 0.2251 (2.8) j0.0505 (j2.2) j0.0175 (j0.2) j0.4471 (j2.4) j0.0784 (2.0) j0.1426 (j2.5) 0.0944 (1.8) 0.1606 (2.5) – – j0.0561 (j3.4) 0.0196 (1.8) 0.6185 (3.4)[2.11] 0.1247 j587.302 1,286

(1) Usual t-statistics; (2) t-statistics (m = 1).

Espino et al. (2006) obtained different WTP measures for each individual in the sample; aggregate WTP were computed using the sample enumeration method. In general it was found that the WTP for transport system improvements were higher for men than for women; this is consistent with the fact that the MUI is higher for women. On the other hand, the SVT was higher for workers than for non-workers, as well as for car users than for bus users. Improvements in frequency were more valued for people older than 35 years, and walking time savings were more valued for people who travelled in the Northern corridor; as walking conditions are significantly poorer there than in the Southern corridor that was an interesting finding. Finally, the WTP for walking time savings was 2.5 times higher than for travel time savings in the case of model 1. For model 2 the WTP for walking time savings was 2.5 times higher if Comfort was high, 1.9 times higher if Comfort was standard and only 1.5 times higher if Comfort was low. In the case of model 2, the values of travel time savings decreased as comfort improves. This is consistent with the fact that travel time produces more disutility as the level of comfort is reduced. 4.2. Simulation Results Confidence intervals for the WTP measures were obtained using a MATLAB code created for this purpose.6 The process time for the case of 50,000 simulations 6

We only obtained confidence intervals for the average WTP of each group.

Confidence Interval for Willingness to Pay Measures in Mode Choice Models

91

ranged from 90 min to 9 h approximately, depending on the simulation strategy. The size of the confidence interval also varied strongly with the strategy used (see Fig. 1) as expected. It is important to point out that in all the cases analysed, the narrowest intervals were obtained for the last simulation strategy i.e., removing the outliers of the WTP distribution as well as individuals with inconsistent marginal utilities in the sample enumeration process. Figure 1 also shows the evolution of the amplitude of the interval with the number of simulations. As can be seen, it does not vary much when we consider more than 10,000 random draws so this could be used to reduce the process time. The size of the intervals remain pretty stable for the third and fourth simulation

Fig. 1 Amplitude of the confidence intervals

92

Espino, Ortu´zar, and Roma´n

Table 3 Confidence intervals for the WTP measures WTP measure

Model 1: Comfort as dummy SV of time by car (pts./min) SV of time by bus (pts/min) SV of walking time (pts/min) SV of frequency (pts/bus per hour) WTP for improving comfort from low to standard (pts) WTP for improving comfort from standard to high (pts) Model 2: Comfort interacting with travel time SV of time by car (pts./min) SV of time by bus comfort low (pts/min) SV of time by bus comfort standard (pts/min) SV of time by bus comfort high (pts/min) SV of walking time (pts/min) SV of frequency (pts/bus per hour) WTP for improving comfort from low to standard (pts) WTP for improving comfort from standard to high (pts)

Estimate

Mean

Lower extreme

Upper extreme

48.25 27.95 68.22 j58.24 718.79

73.10 32.59 72.37 j67.26 764.32

25.41 17.08 28.60 j116.29 385.08

256.55 57.46 136.24 j34.11 1357.27

186.35

199.66

24.91

461.64

44.65 42.84 33.44 25.68 63.23 j58.24 602.32

77.13 53.94 35.22 28.62 69.21 j71.56 669.40

24.87 29.01 17.18 13.06 27.04 j126.62 337.48

262.37 94.86 63.90 52.35 129.92 j34.69 1205.41

209.65

228.94

40.15

506.92

strategies, with the exception of that obtained for the value of time for private transport. This fact could be explained by the high variance obtained for the simulated distribution. Concerning the model specification, model 1 presents narrower intervals for the value of time, the value of frequency and the WTP for improving the comfort of the bus from standard to high. However, these differences are not significantly high. Table 3 presents the lower and upper extremes of the confidence intervals for the WTP measures obtained with the optimal simulation strategy (i.e., the fourth) for models 1 and 2 in the case of 50,000 random draws. These values are compared with the mean of the simulated distribution7 as well as with the point estimates of the corresponding WTP measure obtained from models 1 and 2. In all cases the point estimate is located to the left of the mean of the distribution, with the only exception of the subjective value of the frequency. Note that the subjective value of this attribute is negative because the corresponding marginal utility is positive. Figure 2 shows the evolution of these confidence intervals with the number of simulations. We observe that the upper extreme presents a more unstable behaviour as we vary the number of simulations, especially in the case of the SVT by car. Once more in all cases the mean of the simulated distribution is located below the mid point of the interval, with the exception of frequency for the reasons stated before. It can also be observed that the high variability of the confidence interval for the value of time by car is explained by the variability of its 7

This is derived as the average of the simulated values for the WTP distribution.

Confidence Interval for Willingness to Pay Measures in Mode Choice Models

93

Fig. 2 Confidence intervals

upper extreme in the two models analysed. These results are consistent with those obtained using other methods in more simple models (Armstrong et al., 2001). Finally, Fig. 3 shows the histograms of the simulated distributions of the WTP measures. All distributions have the same asymmetric shape, with the exception of that for the frequency which presents a long tail in the right side. This explains the greater distance between the point estimate and the upper extreme of the interval. A simple observation reveals that the point estimate for the WTP measure derived from our models is pretty close to the mode (maximum frequency value) of the WTP distribution.

94

Espino, Ortu´zar, and Roma´n

Fig. 3 WTP simulated distribution

5. Conclusions Recent improvements in simulation techniques allow obtaining confidence intervals for benefit measures, such as subjective values or willingness-to-pay derived from the estimation of discrete choice models, even when their probability distribution8 is unknown. This approach overcomes the practical 8

This is generally obtained as a function of random variables, or estimators, with a known distribution.

Confidence Interval for Willingness to Pay Measures in Mode Choice Models

95

difficulties associated to methods which work well only in the case of models with linear utility functions (asymptotic t-test and likelihood ratio test). Non-linear utility specifications are becoming more prevalent in demand modelling as they provide detailed information about the WTP of different groups, helping to understand people’s decision making processes better. Confidence intervals for the WTP measures may represent a key element to evaluate the range of benefits in transport projects. They are especially useful when investment decisions are sensitive to individuals_ WTP for a new project, as they provide planners with more helpful information than the simply point estimates normally used in practice. In this paper we obtained confidence intervals for WTP measures derived from mode choice models estimated using a RP/SP data bank for suburban trips. These models incorporate interaction effects among policy variables and a latent variable such as comfort. Systematic heterogeneity in individual tastes is also considered through the specification of non-linear terms (i.e., interactions between some socio-economic variables and the modal attributes). This complexity in the utility specification yielded WTP measures that were not a simple ratio between two parameters and took a different value for every individual in the sample. Thus, to obtain the confidence intervals for the different WTP measures we had to recourse to the Multivariate Normal simulation method. Our results indicate that the size of the interval is affected by the outliers of the simulation as well as by the magnitude of the simulated parameters. As the model specification was based on a microeconomic analysis of consumer decisions the magnitude of the simulated parameters should be consistent with all the microeconomic principles derived from the analysis. In our case, this means that the marginal utilities of the different attributes must have a correct sign for every individual in the sample (i.e., before applying sample enumeration to obtain the corresponding WTP measure); otherwise, such individuals should be removed. The elimination of outliers in two steps (first, form the simulated Multivariate normal distribution and second, from the simulated distribution of the WTP), as well as the removal of individuals with inconsistent marginal utilities, was the simulation strategy that provided narrower confidence intervals. In this case the amplitude of the intervals remained constant as we increased the number of simulations. Finally, in all the cases analysed the simulated distribution of the various WTP measures presented a close asymmetric shape for the two model specifications studied.

Acknowledgments Part of this research was carried out during a research stay of the first author at the Institute of Transportation Studies at UC Berkeley. We are grateful to Professor Sergio Jara-Dı´az (University of Chile), Dr. Elisabetta Cherchi (University of Cagliari) and Dr. Staffan Algers (TRANSEK AB) for their comments which proved of valuable assistance. We also thank the comments of two anonymous referees who helped us end up with a better paper. This research was financed by Fundacio´n Universitaria (FULP) at the Universidad de Las Palmas de Gran Canaria and by the Direccio´n General de Universidades e Investigacio´n, of the Canary Islands Government. We are also grateful for the financial assistance of FONDECYT through Projects 1020981 and 1050672.

96

Espino, Ortu´zar, and Roma´n

References Armstrong P, Garrido R, Ortu´zar J de D (2001) Confidence intervals to bound the value of time. Transp Res E 37:143–161 Ben-Akiva M, Lerman SR (1985) Discrete choice analysis: theory and application to travel demand. MIT, Cambridge, Massachusetts Ben-Akiva ME, Morikawa T (1990, July) Estimation of travel demand models from multiple data sources. Proceedings 11th International Symposium on Transportation and Traffic Theory, Yokohama Bradley MA, Daly AJ (1997) Estimation of logit choice models using mixed stated preference and revealed preference information. In: Stopher PR, Lee-Gosselin M (eds) Understanding travel behaviour in an era of change, Pergamon, Oxford Espino R, Roma´n C, Ortu´zar J de D (2004) Disen˜o de preferencias declaradas para analizar la demanda de viajes. Estudios de Economı´a Aplicada 22:1–33 (in Spanish) Espino R, Roma´n C, Ortu´zar J de D (2006) Analysing demand for suburban trips: a mixed RP/SP model with latent variables and interaction effects. Transportation 33:241–261 Ettema D, Gunn H, De Jong G, Lindveld K (1997) A simulation method for determining the confidence interval of a weighted group average value of time. Proceedings 25th European Transport Forum, Brunel University, England Galvez T, Jara-Dı´az SR (1998) On the social valuation of travel time savings. Int J Transp Econ 25:205–219 Garrido, RA, Ortu´zar J de D (1993). The Chilean value of time study: methodological developments. Proceedings 21st PTRC Summer Annual Meeting, University of Manchester Institute of Science and Technology, England Jara-Dı´az SR (1990) Consumer_s surplus and the value of travel time savings. Transp Res 24:73– 77 Jara-Dı´az SR (1998) Time and income in travel choice: towards a microeconomic activity-based theoretical framework. In: Garling T, Laitila T, Westin K (eds) Theoretical foundations of travel choice modelling. Elsevier Science, New York Jara-Dı´az SR, Videla J (1989) Detection of income effect in mode choice: theory and application. Transp Res B 23:393–400 Jara-Dı´az SR, Ga´lvez T, Vergara C (2000) Social valuation of road accident reductions using subjective perceptions. J Transp Econ Policy 34:215–232 Lancsar E, Savage E (2004a) Deriving welfare measures in discrete choice experiments: inconsistency between current methods and random utility and welfare theory. Health Econ 13:901–907 Lancsar E, Savage E (2004b) Deriving welfare measures in discrete choice experiments: a response to Ryan and Santos Silva. Health Econ 13:919–924 Louviere JJ, Hensher DA, Swait JD (2000) Stated choice methods: analysis and application. Cambridge University Press, Cambridge McFadden D (1981) Econometric models of probabilistic choice. In: Manski C, McFadden D (eds) Structural analysis of discrete data: with econometric applications. MIT, Cambridge, Massachusetts Ortu´zar J de D, Willumsen LG (2001) Modelling Transport 3rd edition, Wiley, Chichester Ryan M (2004) Deriving welfare measures in discrete choice experiments: a comment to Lancsar and Savage (I). Health Econ 13:909–912 Santos Silva JMC (2004) Deriving welfare measures in discrete choice experiments: a comment to Lancsar and Savage (II). Health Econ 13:913–918 Small KA, Rosen HS (1981) Applied welfare economics with discrete choice models. Econometrica 49:105–130 Train KE (2003) Discrete choice methods with simulation. Cambridge University Press, Cambridge Williams HCWL (1977) On the formation of travel demand models and economic evaluation measures of user benefit. Environ Plan A9:167–219