0% found this document useful (0 votes)
77 views47 pages

Data Collection and Analysis Methods

The document discusses different types of data sources, data collection methods, and data analysis methods. It covers existing sources, statistical studies, the internet as data sources. Quantitative methods like questionnaires and documents, and qualitative methods like interviews, observation, and focus groups are examined. Descriptive and inferential statistics are presented as methods for analyzing collected data.

Uploaded by

dtnquang2001
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
77 views47 pages

Data Collection and Analysis Methods

The document discusses different types of data sources, data collection methods, and data analysis methods. It covers existing sources, statistical studies, the internet as data sources. Quantitative methods like questionnaires and documents, and qualitative methods like interviews, observation, and focus groups are examined. Descriptive and inferential statistics are presented as methods for analyzing collected data.

Uploaded by

dtnquang2001
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd

Table of Contents

Part A..................................................................................................................................1

II. Data sources............................................................................................................1

Existing sources..............................................................................................................1

Statistical studies............................................................................................................2

The internet....................................................................................................................3

III. Data collection method............................................................................................3

Quantitative....................................................................................................................3

Qualitative.......................................................................................................................4

Data analysis methods...................................................................................................5

Part B..................................................................................................................................6

I. Summary for qualitative data................................................................................6

II. Summary for quantitative data...........................................................................13

III. Price and quantitative data..............................................................................21

IV. Price and qualitative data................................................................................30

Part C................................................................................................................................34

1. T-test.......................................................................................................................34

2. Regression model..................................................................................................37
Part A

II. Data sources


There are three main type of data: Existing sources, Statistical studies, The Internet
Existing sources
These are data collected by organizations from companies and business organizations.
Most of the information includes information about employees, customers or information
about business activities such as revenue, profit, sales. Some of the trusted organizations
that collect this information include government agencies, professional associations or
companies themselves.

Advantage Disadvantage

- Cheap - Rare detailed data


- Can be collected from many
sources
- Easy to collect
- No time-consuming

Statistical studies
Two main types of studies include: experimental and observational. Where
experimental research is a type of research where the researcher will base on a
definite variable and collect data on the factors affecting this definite variable in a
controlled environment. Studies are of the observational type, which is the study of
population variability. The researcher would observe spontaneously and not use
controls.

Advantage Disadvantage
Experimental study - Highest level of - Not likely to be
control accurate to real-life
- Provide specific situation
and relevant results - Difficult to
- Really good to replicate the result
identify problem, - Made by human so
cause, effect and there is high chance
connections be human flaw

Observational study - More accurate to - Difficult to identify


real-life situation problem, cause,
effect and
connections
- Researcher bias an
affect the final
result

The internet
With the rapid development of the Internet, this data source has become extremely
popular and popular. Today corporations and companies from small to large, both public
and limited companies have their own websites to publicize information. In addition,
government agencies also post information resources on their own websites.

Advantage Disadvangte

- Unlimited amount of public - Require knowledge to identify


information sources that have accuracy,
- Easy to access and time saving reliability and legality information
III. Data collection method
There are two main methods of data collection including qualitative and quantitative
methods. In which, the quantitative method is a method of collecting information that can
be expressed in numbers, and the quantitative method is to collect information through
words, words or non-numerical information. Using the right data collection method will
help avoid wasted time, optimize results, and drive direction. Ainsworth (2021)

Quantitative
a. Questionnarires and surveys methods

In these methods, the researcher creates his own, often closed-ended questions and
leads to observations and information gathering through answers. The main
advantage of this method is that it is cheap and saves time. Thanks to the ubiquity
of the internet, creating pre-sampled questionnaires and distributing them became
very easy, fast, and almost at no cost. So it is very convenient for surveys with a
large number of observations and to select observations at random. However, the
answers and scoring will have to be based on a certain scale, so it is difficult for
the respondents to give the most accurate feedback. The responses collected are
also often not so detailed and complete compared to methods such as interviews.
Finally, respondents often refuse to answer questionnaires like these because of
the time they spend and get nothing in return

b. Documents and records methods


A method by which researchers gather information from existing records and
data, mostly collected by other researchers. For example, financial records,
attendance records. The strength of this method is that most of the studies are
public so it's free, although collecting data from many different sources is quite
time-consuming, it still saves time and resources than surveying a sample
yourself. large, making it a very viable source of data for students or those new to
science. However, many studies are confidential and difficult to access, and
relying entirely on this data source also jeopardizes the ability to find
breakthroughs in science.

Qualitative
a. Interviews
The researchers gathered information through interviews by asking open-ended
questions. The strength of this approach is that it helps researchers dig into the
details and dig into certain areas of interest. However, the weakness of this method
requires the researcher to have soft skills because the interview is a conversation
between people. In addition, organizing an interview is also more expensive in
terms of both time and cash. To minimize these disadvantages, researchers
nowadays often use online interviews to save time and travel costs.
b. Observation
Observation is a form of information collection based solely on the researcher and
not through questions. The use of observation is only used where the use of other
methods of information collection is too difficult to measure and too complex.
Because the information collected through this method is very subjective because
the researcher will apply his own judgment to many observations. In a few cases
the probability of deviation is not too significant.
c. Focus group
This method is a combination of questioning, surveying and observation.
Researchers use this tool to collect data from a group of people who have
something in common. By collecting data from individual to a common problem.
This is a fairly comprehensive method as it helps the researcher to get a general
understanding of the problem and can ask questions or conduct interviews to dig
deeper. However, gathering a large group of people for interviews takes time and
money, which is the biggest weakness of this method.
Data analysis methods
Data analysis method was divided into two main types: Descriptive statistic and
Inferential statistic. This two methods are adopted and turn raw data into
meaningful and specify information into statistical knowledge.

Descriptive statistics Inferential statistics

Definition Descriptive Statistics are Inferential statistics


coefficients that includes many methods
succinctly describe or such as estimation,
summarize a given data making hypotheses and
set, which may be testing hypotheses,
representative of the analyzing correlations,
whole or a sample of a relationships between
population. Bickel & research objects, making
Lehmann, 2012). predictions, on the basis
of analysis. Analyze
sample data to find
insights and
characteristics of the
population. (CFI, n.d)

Application Arrange, analyze and Corelate, examine and


show raw data in a easy predict transformed
way to caculated information

Final result sample Charts, graphs, tables Proability

Both descriptive and inferential statistics methods have in common the same analysis of
data for the whole population. (Laerd, 2019) However descriptive statistics are used by
selecting a small group from the general data that the researcher wants to describe. Then
measure the data in this group and compare with the statistical overview. Inferential
statistics is the technique of analyzing a small sample in a review and then making
inferences and interpretations for the whole population. (Frost, 2018)

Part B
I. Summary for qualitative data

Type of the accommodation


Percen
Frequency t Valid Percent Cumulative Percent
Valid Entire home/apt 177 35.3 35.3 35.3
Private room 319 63.7 63.7 99.0
Shared room 5 1.0 1.0 100.0
Total 501 100.0 100.0
Total 501 accommodation private rooms account for the highest rate of 63.7% with a
total of 319 rooms. The type of accommodation with the second largest proportion is
Entire home/apartment with 177 rooms, accounting for 35.3%. Finally, there is a shared
room with 5 rooms accounting for 1%.

Whether the room is shared or not


Frequency Percent Valid Percent Cumulative Percent
Vali Not shared 496 99.0 99.0 99.0
d Shared 5 1.0 1.0 100.0
Total 501 100.0 100.0
Of the total 501 rooms, 496 rooms are not shared, accounting for 99%, the rest are 1%
shared rooms, accounting for 1%.

Whether the room is private or not


Percen
Frequency t Valid Percent Cumulative Percent
Vali Not private 182 36.3 36.3 36.3
d Private 319 63.7 63.7 100.0
Total 501 100.0 100.0
There are 182 non-private rooms out of 501 rooms, accounting for 36.3%. The remaining
63.7% are private rooms with 319 rooms.

Whether the host is a superhost or not


Frequenc
y Percent Valid Percent Cumulative Percent
Valid Not a superhost 422 84.2 84.2 84.2
Superhost 79 15.8 15.8 100.0
Total 501 100.0 100.0
According to the statistics table, there are 79 rooms with super hosts, accounting for
15.8%. The remaining 84.2% of the rooms without hosts are super hosts for a total of
422.

Whether the listing is for multiple rooms or not


Frequenc
y Percent Valid Percent Cumulative Percent
Vali Not multiple rooms 387 77.2 77.2 77.2
d Multiple rooms 114 22.8 22.8 100.0
Total 501 100.0 100.0
With the data whether the listing is for multiple rooms or not, there are a total of 501
rooms, of which 114 multiple rooms account for 22.8%. Not multiple room has 378
rooms, accounting for 77.2%.

Whether the listing is for business purposes or not


Frequenc Valid Cumulative
y Percent Percent Percent
Valid Not for business 253 50.5 50.5 50.5
purposes
For business purposes 248 49.5 49.5 100.0
Total 501 100.0 100.0
Based on the information on the table and chart, there are 248 rooms used for business
purposes, accounting for 49.5%. The remaining 253 rooms are not used for business
purposes and account for 50.5%.
II. Summary for quantitative data
Number of bedrooms

Std.
N Minimum Maximum Mean Deviation
Number of 501 0 4 1.12 .496
bedrooms
Valid N (listwise) 501
Frequency Percent Valid Percent Cumulative Percent
Valid 0 24 4.8 4.8 4.8
1 403 80.4 80.4 85.2
2 65 13.0 13.0 98.2
3 8 1.6 1.6 99.8
4 1 0.2 0.2 100.0
Total 501 100.0 100.0
The number of bedrooms data is quantitative, but since there are so few values and the
valids are all real and determinable, I treat it as a quantitative variable.
Out of the total of 501 rooms, there is only 1 room with 4 bedrooms, accounting for
0.2%. There are 403 rooms with 1 bedroom, the highest percentage is 80.4%. 13% of the
rooms have 2 bedrooms with a total of 65 rooms. 24 of these rooms have no bedrooms,
accounting for 4.8% of the total number of rooms. Finally, the number of rooms with 3
bedrooms, including 8 rooms, accounts for 1.6%.

Full price of accommodation for two people and two nights in EUR
Std.
N Minimum Maximum Mean Deviation
Full price of accommodation for 501 77.38 12886.24 386.1338 702.33537
two people and two nights in EUR
Valid N (listwise) 501

The average price of a room for two people for two nights in EUR is 386,133 euros. The
lowest price is 77.38 euros and the highest is 12886.24. The price range is very large with
the standard deviation up to 702.33. Looking at the chart, it can be seen that most room
rates fluctuate between 0-2500. The big price spikes account for a very small percentage
and are the outlines.

Maximum number of guests that can stay in the room


Mea Std.
N Minimum Maximum n Deviation
Maximum number of guests that can 501 2 6 2.81 1.223
stay in the room
Valid N (listwise) 501

Cleanliness rating
N Minimum Maximum Mean Std. Deviation
Cleanliness rating 501 2 10 9.03 1.177
Valid N (listwise) 501
Overall rating
N Minimum Maximum Mean Std. Deviation
Overall rating 501 20 100 88.69 10.813
Valid N (listwise) 501
The average rating of the overall rating is 88.69 with the lowest rating of 20 and the
highest rating of 100. The standard deviation of the overall rating is quite high at 10,813.
The reason for this high index can be seen in the chart when most ratings are in the range
of 60-100, but ratings outside this range can be considered as outliners.

Distance from city centre in km


N Minimum Maximum Mean Std. Deviation
Distance from city centre in km 501 0.476 10.226 4.64306 1.966065
Valid N (listwise) 501
The average distance to the city center is 4,643 km. With the shortest distance is 0.476
km and the highest is 10,226 km. The standard dviation index of 1,966 is quite low due to
the other indicators analyzed above. The line graph of the distance to the city center has a
bell shape ( nearly normal distribution).

Distance from nearest metro station in km


Std.
N Minimum Maximum Mean Deviation
Distance from nearest metro 501 0.014 4.245 0.65940 0.618365
station in km
Valid N (listwise) 501
The average distance of all rooms in the dataset to the nearest metro station is 0.65940
km. With the closest room to the metro station is 0.14 km, and the room farthest from the
metro station is 4.24 km. The standard deviation is 0.618 which is the lowest of all
analyzed data. However, the number of rooms is usually 2-2.5 km from the nearest metro
station. The rooms further away from this distance are quite small in number.

Attraction index
N Minimum Maximum Mean Std. Deviation
Attraction index 501 7.610 76.335 21.9449 9.488960
0
Valid N (listwise) 501
The average attraction index for this dataset is 21.94. With a minimum of 7,610 and a
maximum of 76,335. The standard deviation of 9,488 is quite high because most of the
attraction index is in the range of 0-50.

Restaurant index
N Minimum Maximum Mean Std. Deviation
Restaurant index 501 4.171 38.666 11.5161 4.489105
1
Valid N (listwise) 501
The average restaurant index reached 11,516. The lowest index of 4,171 is 34.5 points
less than the room with the highest restaurant index of 38,666 points. The standard
deviation is 4.48. Looking at the chart, it can be seen that most of the restaurant indexes
of the rooms fluctuate in the range of 0-25 points.
III. Price and quantitative data
1. Number of bed room
Full price of accommodation for
Number of two people and two nights in
bedrooms EUR
Number of Pearson 1 0.221**
bedrooms Correlation
Sig. (2-tailed) 0.000
N 501 501
Full price of Pearson 0.221** 1
accommodation for Correlation
two people and Sig. (2-tailed) 0.000
two nights in EUR N 501 501
**. Correlation is significant at the 0.01 level (2-tailed).
The pearson correlation between price and number of bedrooms rating is 0.221 which is
almost zero and the sig index of 0.00 is quite far from 0.5 (Statistical significance). This
proves that the correlation between these two variables is very low. However, the scatter plot
shows that a high number of bedroooms has no effect on price.

2. Cleanliness rating

Full price of
accommodat
ion for two
people and
two nights
in EUR Cleanliness rating
Full price of Pearson 1 -0.011
accommodation for Correlation
two people and two Sig. (2-tailed) 0.809
nights in EUR N 501 501
Cleanliness rating Pearson -0.011 1
Correlation
Sig. (2-tailed) 0.809
N 501 501

The pearson correlation between price and cleanliness rating is -0.11 which is almost zero
and the sig index of 0.809 is quite far from 0.5 (Statistical significance). This proves that
the correlation between these two variables is very low. However, the scatter plot shows
that a high cleanliness rating has a positive effect on room rates but not strong.
3. Overall rating
Full price of
accommodat
ion for two
people and
two nights Overall
in EUR rating
Full price of Pearson 1 0.011
accommodation for Correlation
two people and two Sig. (2-tailed) 0.814
nights in EUR N 501 501
Overall rating Pearson 0.011 1
Correlation
Sig. (2-tailed) 0.814
N 501 501
The correlation coefficient of the room rate for 2 people for two nights in Eu for the
overall rating of 0.011 is almost zero, and the sig coefficient 0.814 is quite far from 0.5
(Statistical significance). It can be concluded that the change in overall rating has a very
small impact on the price, the chart shows the same thing.

4. Distance from nearest metro station in km

Full price of
accommodat Distance
ion for two from nearest
people and metro
two nights station in
in EUR km
Full price of Pearson 1 -0.095*
accommodation for Correlation
two people and two Sig. (2-tailed) 0.034
nights in EUR N 501 501
Distance from nearest Pearson -0.095* 1
metro station in km Correlation
Sig. (2-tailed) 0.034
N 501 501
The correlation coefficient of -0.95 is pretty close to -1 between the two price variables
and the distance to the nearest metro station. This proves that the distance to the nearest
metro station has a strong negative impact on the price. The scatter plot does not see this
clearly because some of the variables are too high. But the value of sig is 0.034 which is
very low so it is not statistically significant

5. Attraction index

Full price of
accommodat
ion for two
people and
two nights Attraction
in EUR index
Full price of Pearson 1 0.182**
accommodation for Correlation
two people and two Sig. (2-tailed) 0.000
nights in EUR N 501 501
Attraction index Pearson 0.182** 1
Correlation
Sig. (2-tailed) 0.000
N 501 501

The correlation value of 0.182 is almost zero so it shows that the attraction index effect
has no effect on the price and the scatter plot also shows this. However, the sig value is 0,
so it is not statistically significant.
6. Distance from city centre in km

Full price of
accommodat
ion for two
people and Distance
two nights from city
in EUR centre in km
Full price of Pearson 1 -0.119**
accommodation for Correlation
two people and two Sig. (2-tailed) 0.008
nights in EUR N 501 501
Distance from city Pearson -0.119** 1
centre in km Correlation
Sig. (2-tailed) 0.008
N 501 501
The correlations between the price of the room and the distance to the city center is -
0.119 which is very low, which shows that there is almost no relationship between the
price and the distance to the city center. However, the sig coefficient of 0.008 is almost
zero, indicating no statistical value. Looking at the scatter plot we also see that there is no
effect between the distance to the city center and the room rate.
7. Restaurant index

Full price of
accommodat
ion for two
people and
two nights Restaurant
in EUR index
Full price of Pearson 1 0.154**
accommodation for Correlation
two people and two Sig. (2-tailed) 0.001
nights in EUR N 501 501
Restaurant index Pearson 0.154** 1
Correlation
Sig. (2-tailed) 0.001
N 501 501

Finally, the price and restaurant index with the pearson corelation value of 0.154, very
low, shows that there is almost no relationship between these two variables. However, a
Sig value of 0.001 shows no statistical value, the scatter plot shows no relationship
between the two variables.
IV. Price and qualitative data
Full price of
accommodat
ion for two
people and
two nights
in EUR
Type of the Entire Mean 609.82
accommodation home/apt Standard 605.55
Deviation
Private room Mean 265.32
Standard 727.63
Deviation
Shared room Mean 175.45
Standard 27.60
Deviation

The average price of entire home is 609.82 euros which is 400 euros higher than the
private room, and the shared room has the lowest average price of 175.45. So it can be
seen that the type of room has an effect on the price. The standard deviation of entire
home and private is very high at 605 and 727 respectively which means that the price of
this two type are much higher than the average price. Shared rooms with a standard
deviation of 27.6 have price near the average price. It can also be concluded that it is
more likely that the entire home and private room have multiple outliners and the price
fluctuation of this room type is very high.
Full price of
accommodat
ion for two
people and
two nights
in EUR
Whether the room is Not Mean 388.26
shared or not shared Standard 705.55
Deviation
Shared Mean 175.45
Standard 27.60
Deviation
Not shared room has an average price of 388.26 compared to the average price of 175 of
shared room can see whether the room is shared or not can affect the price. However, the
standard deviation of not shared room is very high, so the price fluctuation of this room
type is very high and it may be because outliners should be much higher than the average
price. The price of the shared room has a standard deviation of 27.6 which is closer to the
mean price.

Full price of
accommodat
ion for two
people and
two nights
in EUR
Whether the room is Not Mean 597.89
private or not private Standard 601.37
Deviation
Private Mean 265.32
Standard 727.63
Deviation

The average price of not private room is 597.89 which is higher than that of private room
with the average price of 265.32. So there is a chance a room is private or not affect the
price. However, the fluctuation range of both types is very high at 601.37 and 727.63
respectively, showing that the price volatility is very large, there is also a lot of possibility
of outliners.

Full price of accommodation for


two people and two nights in EUR
Whether the host is a Not a Mean 380.00
superhost or not superhost Standard 679.77
Deviation
Superhost Mean 418.90
Standard 816.46
Deviation

The average price of a room without a superhost of 380 is lower than the average price of
a superhost room of 418. There is a possibility of having a superhost or not affecting the
price. However, the standard deviations of these two variables are very high at 679.77
and 816.46, respectively. This means price swings are very large and there are likely to be
outliners.
Full price of
accommodat
ion for two
people and
two nights
in EUR
Whether the listing is Not multiple Mean 390.50
for multiple rooms or rooms Standard 779.48
not Deviation
Multiple rooms Mean 371.31
Standard 326.89
Deviation

Multiple room and not multiple room have average prices that are not too different at 390
and 326, respectively. So it is unlikely that the price will be affected. However, the
standard deviation of both variables is very large at 779.48 and 326.89, respectively, so it
can be seen that the price fluctuation is very large and most likely due to outliners.
Full price of
accommodat
ion for two
people and
two nights
in EUR
Whether the listing is Not for business Mean 423.17
for business purposes purposes Standard 944.88
or not Deviation
For business purposes Mean 348.35
Standard 291.24
Deviation

Non-business rooms have an average price of 432.17 more than business room rates by
an average of 348.35. The standard deviation of these two variables is very high at 944.88
and 291.24 respectively, indicating very large price volatility and possibly due to
outliners.

Part C
1. T-test

Group Statistics
Whether the listing is
for business purposes Std. Std. Error
or not N Mean Deviation Mean
Full price of Not for business 253 423.171 944.87984 59.40409
accommodation for purposes
two people and two For business purposes 248 348.349 291.24384 18.49400
nights in EUR
Independent Samples Test
Levene's
Test for
Equality
t-test for Equality of Means
of
Variance
s
95%
Sig. Std. Confidence
Mean
Sig (2- Error Interval of the
F t df Differen
. taile Differen Difference
ce
d) ce Low
Upper
er
Equal
varianc -
1.80 .18 1.1 0.23
Full price of es 499 74.822 62.73 48.4 198.07
3 0 9 4
accommodati assume 2
on for two d
people and Equal
two nights in varianc -
1.2 300.3
EUR es not .230 74.822 62.21 47.6 197.25
0 38
assume 1
d

The median non-business room rate of 423.71 was higher than the average business room
rate of 348.24. The standard deviations of both variables are very high at 944.87 and
291.24.
Because Sig.(F) =0.180 > α = 0.05
 Ho is accepted
 Equal variances assumed
 Application of T-test:
 H 0 : µ1 = µ 2

 H 1 : µ1 ≠ µ 2

 The Sig (2-tailed) = 0.000 < α = 0.05


 Application of T-test:
 H 0 : µ1 = µ 2

 H 1 : µ1 ≠ µ 2

 The Sig (2-tailed) = 0.000 < α = 0.05


Using the F-test, with F value is 1.803
H0: µ1( = µ2
H1: µ1 ≠ µ2
The Sig (2-tailed) = 0.234 > α = 0.05
H0 is accepted
µ1 = µ2
The average price of bussiness room and not for business room are similar. So whether
the listing is for business purposes or not does effect to the price.
Group Statistics
Whether the room is Std. Std. Error
private or not N Mean Deviation Mean
Full price of Not private 182 597.887 601.37118 44.57660
accommodation for Private 319 265.321 727.62745 40.73932
two people and two
nights in EUR
Independent Samples Test
Levene's
Test for
Equality of
Variances t-test for Equality of Means
95%
Confidence
Sig. Std. Error Interval of the
(2- Mean Differenc Difference
F Sig. t df tailed) Difference e Lower Upper
Full price of Equal 3.58 .059 5.23 499 0.000 332.56 63.58 207.63 457.50
accommodation variances 5
for two people assumed
and two nights Equal 5.5 436.3 .000 332.56 60.38 213.8 451.25
in EUR variances
not
assumed

The median private room rate of 265.321 was lower than the average price of not private
room with the rate of 597.887. The standard deviations of both variables are quite similar
40.73932 and 44.57660.
Because Sig.(F) =0.059 > α = 0.05
 Ho is accepted
 Equal variances assumed
 Application of T-test:
 H 0 : µ1 = µ 2

 H 1 : µ1 ≠ µ 2

 The Sig (2-tailed) = 0.000 < α = 0.05


 Application of T-test:
 H 0 : µ1 = µ 2

 H 1 : µ1 ≠ µ 2

 The Sig (2-tailed) = 0.000 < α = 0.05


For F-test, with F value is 3.585
H0: µ1 = µ2
H1: µ1 ≠ µ2
The Sig (2-tailed) = 0.000 < α = 0.05
H0 is rejected
µ1 ≠ µ2
The average price of not private room and private are different. So whether Whether the
room is private or not does not effect to the price.

2. Regression model

Model Summary
Mode R Adjusted R Std. Error of
l R Square Square the Estimate
1 0.316a 0.100 0.085 671.76140

R Square= 0.1, so 10% of the price affected by all the quantitative data below.

Coefficientsa
Unstandardized Standardized
Coefficients Coefficients
Model B Std. Error Beta t Sig.
1 (Constant) -1163.814 441.555 -2.636 0.009
Maximum number of 69.864 32.049 0.122 2.180 0.030
guests that can stay in
the room
Cleanliness rating -38.014 40.821 -0.064 -0.931 0.352
Overall rating 4.307 4.502 0.066 0.957 0.339
Number of bedrooms 191.143 78.159 0.135 2.446 0.015
Distance from city 106.258 38.940 0.297 2.729 0.007
centre in km
Distance from nearest -76.680 57.201 -0.068 -1.341 0.181
metro station in km
Attraction index 25.966 8.199 0.351 3.167 0.002
Restaurant index 7.636 15.900 0.049 0.480 0.631

 The estimated model:


realSum = B0 + B1* Maximum number of guests that can stay in the room + B2*
Cleanliness rating + B3* Overall rating + B4* Number of bedrooms + B5*
Distance from city centre in km + B6* Distance from nearest metro station in km
+ B7* Attraction index + B8* Restaurant index
With all independent variables are fixed
B0: is the regression constant, represents the price if other variables do not consider
B1: The maximum number of people in a room increases by 1, the average price of the
room increases by 69.86 euros

B2: Cleanliness rating increases by 1* the average room rate will decrease by 38,014
euros

B3: Overall rating increased by 1* average room rate increased by 4,307 euros

B4: The number of bedrooms in a residence increases by 1, the average room price
increases by 191,143 euros
B5: The distance from the city center increases by 1km, the average room price increases
by 106,258 euros

B6: The distance to the nearest metro increases by 1km, the average room price decreases
by 76,680 euros

B7: Attraction index increases by 1, the average room rate increases by 25,966 euros

B8: Restaurant index increases by 1, the average room rate increases by 7,636 euros

 Test for the significance of each independent variable


Hypothesis: H0: βn = 0, H1: βn ≠ 0 (n = 1->8)

If p-value < α (0.05) so H0 is unaccepted -> the variable is significant

The maximum number of people in a room: Sig = 0.030 < 0.05 => Unaccepted H0 =>
Significant

Cleanliness rating: Sig = 0.352 > 0.05 => Accepted H0 => Insignificant

Overall rating: Sig = 0.339 > 0.05 => Accepted H0 => Insignificant

The number of bedrooms: Sig = 0.015 < 0.05 => Unaccepted H0 => Significant

The distance from the city center: Sig = 0.007 < 0.05 => Unaccepted H0 => Significant

The distance to the nearest metro: Sig = 0.181 > 0.05 => Accepted H0 => Insignificant

Attraction index: Sig = 0.002 < 0.05 => Unaccepted H0 => Significant

Restaurant index: Sig = 0.631 > 0.05 => Accepted H0 => Insignificant

Best fit model:

Coefficientsa
Standardized
Unstandardized Coefficients Coefficients
Model B Std. Error Beta t Sig.
1 (Constant) -1034.529 324.138 -3.192 .002
Maximum number of guests 67.680 31.732 .118 2.133 .033
that can stay in the room
Number of bedrooms 191.087 77.243 .135 2.474 .014
Distance from city centre in 87.617 35.076 .245 2.498 .013
km
Attraction index 27.770 7.220 .375 3.846 .000
a. Dependent Variable: Full price of accommodation for two people and two nights in EUR

You might also like