The Problem of Sample Representativeness for Conducting Experimental and Broad Psychological Research
Problems of forming a representative sample of subjects during psycholinguistic research. Application of special strategies for determining the representative quantitative composition of samples by comparing the frequency of the studied characteristics.
Рубрика | Социология и обществознание |
Вид | статья |
Язык | английский |
Дата добавления | 13.10.2018 |
Размер файла | 63,3 K |
Отправить свою хорошую работу в базу знаний просто. Используйте форму, расположенную ниже
Студенты, аспиранты, молодые ученые, использующие базу знаний в своей учебе и работе, будут вам очень благодарны.
Размещено на http://www.allbest.ru/
Размещено на http://www.allbest.ru/
The Problem of Sample Representativeness for Conducting Experimental and Broad Psychological Research
Iia Gordiienko-Mytrofanova
ABSTRACT
The article is devoted to the problem of creating a representative sample of respondents in the course of experimental and broad psycholinguistic research, first of all, its quantitative composition and structure.
The primary method of research was the psycholinguistic experiment, the main stage of which is a free association experiment with «playfulness» as a stimulus word. The use of mathematical and statistical procedures confirmed the hypothesis and helped to achieve the goal.
The hypothesis of the experimental psycholinguistic research into «playfulness» stimulus as a stable personality trait was to prove the effectiveness of applying specific strategies to determine the representative quantitative composition of samples by means of comparing the frequency of the studied characteristics.
In terms of theoretical substantiation, the approaches to the determination of the quantitative composition of the experimental sample and the statistical calculations based on the results of the practical research into associative reactions to the stimulus «playfulness», it is proved that as the size of the sample decreases, the distinction grows, meaning that groups with fewer respondents do not reflect all the characteristics of the general population. The assumption was proved that the number of 100 people or close to that could not meet this requirement in the case of extensive research, the minimum size of the sample should be about 400-500 people, though this number is not always sufficient either. The sufficient quantitative composition of the experimental sample, in large general populations, varies from 400 to 1500 persons and depends on the quantitative and qualitative structure of the general population and the organizational peculiarities of the research. So, the sample should be enough to meet the requirement, which is to reflect the main tendencies and characteristics of both the general population and the purpose of the study.
Key words: experimental research, psycholinguistic experiment, respondents, general population, representative sample, strategies and criteria of sample formation, the quantitative and qualitative composition of the sample.
Статья посвящена проблематике формирования репрезентативной выборки испытуемых при проведении экспериментальных и широких психолингвистических исследований, в первую очередь, ее количественного состава и структуры.
Основным методом исследования выступил психолингвистический эксперимент, главным этапом которого является свободный ассоциативный эксперимент со словом-стимулом «игривость». Подтверждение гипотезы и достижение цели осуществлено с помощью математико-статистических процедур.
Гипотеза экспериментального психолингвистического исследования стимула «игривость», как устойчивого личностного свойства, заключалась в обосновании эффективности применения специальных стратегий определения репрезентативного количественного состава выборок путем сравнения частоты исследуемых характеристик.
Теоретически обосновывая подходы к определению количественного состава экспериментальной выборки и применяя статистические расчеты по результатам практического исследования ассоциативных реакций на стимул «игривость», доказано, что при уменьшении объема выборки различия растут, и группы с меньшим количеством исследуемых не отображают всех характеристик генеральной совокупности. Подтвердилось предположение о том, что количество в 100 человек или около того не может удовлетворить данное требование при широких исследованиях, минимальный численный состав выборки (но не всегда достаточный), должен быть около 400-500 человек. Достаточный уровень количественного состава экспериментальной выборки, при больших генеральных совокупностях, находится в интервале от 400 до 1500 человек и зависит от количественного и качественного состава генеральной совокупности и особенностей организации исследования. Таким образом, выборка должна быть достаточной для удовлетворения требований в отражении основных тенденций и характеристик, как генеральной совокупности, так и цели исследования.
Ключевые слова: экспериментальное исследование, психолингвистический эксперимент, испытуемые, генеральная совокупность, репрезентативная выборка, стратегии и критерии формирования выборки, количественный и качественный состав выборки.
Статтю присвячено проблематиці формування репрезентативної вибірки досліджуваних при проведенні експериментальних та широких психолінгвістичних досліджень, у першу чергу 'її кількісного складу та структури. strategy representative psycholinguistic
Основним методом дослідження виступив психолінгвістичний експеримент, головним етапом якого є вільний асоціативний експеримент зі словом-стимулом «грайливість». Підтвердження гіпотези й досягнення мети здійснено за допомогою математико- статистичних процедур.
Гіпотеза експериментального психолінгвістичного дослідження стимулу «грайливість» як стійкої особистісної властивості полягала в обґрунтуванні ефективності застосування спеціальних стратегій визначення репрезентативного кількісного складу вибірок через порівняння частоти досліджуваних характеристик.
За теоретичним обґрунтуванням, підходами щодо визначення кількісного складу експериментальної вибірки й статистичними розрахунками за результатами практичного дослідження асоціативних реакцій на стимул «грайливість», доведено, що при зменшенні обсягу вибірки розрізнення зростають і групи з меншою кількістю досліджуваних не відображують усіх характеристик генеральної сукупності. Підтвердилося припущення про те, що кількість у 100 осіб чи близько до того не може задовольнити цю вимогу при широких дослідженнях, мінімальний чисельний склад вибірки (але не завжди достатній) має бути близько 400-500 осіб. Достатній рівень кількісного складу експериментальної вибірки, за великих генеральних сукупностей, знаходиться в інтервалі від 400 до 1500 осіб і залежить від кількісного та якісного складу генеральної сукупності й особливостей організації дослідження. Тобто вибірка повинна бути достатньою для задоволення вимог у відображенні основних тенденцій та характеристик як генеральної сукупності, так і мети дослідження.
Ключові слова: експериментальне дослідження, психолінгвістичний експеримент, досліджувані, генеральна сукупність, репрезентативна вибірка, стратегії та критерії формування вибірки, кількісний та якісний склад вибірки.
The composition and the nature of reactions inside associative fields bring forward an idea of the context of word usage, finding a meaning which is mentally relevant for the native speakers. Probably, this is the reason why the question of the validity of associative experiments is so relevant and remains open until now.
How many respondents should be interviewed in the course of an associative experiment, so that we can talk about the validity of the results obtained?
As it has been shown in our review of psychological, psycholinguistic and sociological works, which all resort to free- association experiment (FAE) as a research tool, the quantitative characteristics of a sample, i.e. the sample size, which is used to explore the «associativity» of the stimulus, is still one of the most challenging and difficult-to-solve problems, both in methodological and in practical aspects. This problem, in its turn, is associated with a number of some other problems. In particular, it is the extremely insufficient use of broad statistical methods of varifying the hypotheses and the results of associative research, which would increase the reliability of the data obtained, and help to better outline the general population, as well as the sample, and provide the possibility of further transfer of the results over the entire population and making forecasts (Titova, 1975; Goroshko, 1997-2000; Karaulov, 2000).
G. Cherkasova wonders if it would be sufficient to rely on one hundred reactions (a sample which is most often used by researchers) in order to make conclusions on the associativity of the stimulus words, and she comes to the following conclusions summing up the quantitative studies of associative dictionaries: an increase in the number of respondents by 100 people that respond to a stimulus, increases on average by 36,41% the number of different reactions in the vocabulary entries of the direct vocabulary; «constant» reactions (no more than ten) have a relative frequency of occurrence of more than 5% in any sample (surves) of at least 100 respondents; whereas the increase in the sample size (survey) from 100 to 200 respondents causes relative frequencies of occurrence (ranks) of «constant» reactions to stabilize (Cherkasova, 2005).
O. Palkin, answering the question, «What is the bottom threshold, at which the researcher who works on the basis of associative technique has the right to speak about the reliability and completeness of the data obtained?» claims that «the bottom threshold is at the level of 100 respondents». A sample of 100 reactions per stimulus can be considered reliable and sufficiently representative provided that the sample is correctly distributed among the general population. The competent distribution of the sample implies that the number of men and women acting as respondents should be approximately equal and the survey should be conducted in different places and embrace different segments of the population to ensure a variety of opinions and minimize the element of randomness (Palkin, 2008: 82). At the same time, the author concludes that the compilation of an «ideal» sample is a hardly feasible task and it is even theoretically difficult to model such a survey that would cover all segments of the population evenly. Moreover, as he observes, this task seems to be unrealistic for practical purposes (Palkin, 2008: 83).
At the same time, O. Palkin makes a curious remark that one of the leaders of the Moscow Psycholinguistic School, E. Tarasov, «when communicating with the doubters, had a habit of having along a pen and a piece of paper, and afterward, a rough graph appeared on paper. In that graph, up to 50, the increase of new reactions was significant, then up to the value of 100 the increase continued but it was not very significant, and after the value of 100, the growth, although it was present, it was at the negligible level. He further explained that the described tendency is repeated each time when processing reactions to the next stimulus. So, he concluded that 100 responses to each stimulus would be sufficient, but it would be better to obtain a little bit more» (Palkin, 2008: 82).
In our opinion, based on the sample of 100 people, it is impossible to obtain complete and non-distorted information about the features of the general population, since such a sample does not fully reflect the characteristics of the population under study unless the latter is considered as a rather narrow, homogeneous pool of respondents with clear, specific characteristics. But in this case, the general population already requires special selection, and there is a high probability of spoiling the naturalness of the population. Moreover, in the case described above, one can observe an attempt to approximate the general population to the characteristics of the sample rather than on the contrary, although it should be the «opposite».
However, there are some scholars who hold a different point of view, which we share. Thus, O. Goroshko, a recognized Ukrainian expert in the field of associative research, in one of her overviews on the application of quantitative methods to FAE data processing, notes that the number of respondents sufficient to provide the given reliability is about 1,000 people (Goroshko, 2001).
Goroshko also notes that certain statistical laws have been established in the structure of an associative field with the help of quantitative analysis: the idea of the «golden section» and the existence of the connection between the number of reactions in an associative field and its ranks. The concept of the golden section is embodied in the proportions of numerical characteristics of the associative field (its richness and diversity), which reflect the stability of its structure as the values of language proficiency. The optimal quantitative composition of the field as the value of language proficiency one should consider the number of reactions within 400-500 units. In this case, the mentioned indicators (lexical richness and variety) approach the golden section value (Goroshko, 2001).
Yu. Karaulov believes that another statistical regularity in the structure of the associative field bears the rank index of reaction frequencies. As it was calculated, the number of ranks is directly related to the number of reactions. The larger the number of reactions is, the greater is the number of ranks in it. In this case, the described dependence is characterized by certain constancy: if the field is composed of 100 reactions, the number of ranks fluctuates within 5-8, the field of 500 reactions fits within 16-18 ranks, and the field which creates the associative norm (1,000 reactions), distributes its frequencies across 21-23 ranks (Karaulov, 2000).
Berezin mentions that the representative sample, which represents the whole population of Russia should consist of 3,600-9,000 people and 180 groups (two genders, three age groups, two educational levels, three income groups, and five types of settlements). The size of the sample depends on the number of parameters we want to achieve representativeness for. If we are satisfied with representativeness only by gender and age, then a sample of 400 people in one settlement will be more than enough. If we choose three parameters, the number of respondents should be increased to 600. Achieving the representativeness of the sample by five parameters simultaneously (gender, age, income, education, sphere of professional activity) is possible only with a sample of 1,000-1,200 people in one settlement (Berezin, 2012).
At the same time, O. Shmeliov states that the size of a representative sample cannot be determined a priori and is not reduced to a predetermined constant (or a derivative from the size of the general population). All these attempts (to calculate something without carrying out a real survey) are characteristic of abstract mathematicians, who tend to ignore the unpredictability of natural processes (Shmeliov, 2016).
We fully share the opinion of scientists that the compilation of an «ideal» sample is hardly achievable. And it cannot be feasible if the general population is represented by an experimental sample, which is formed without any strategy, since it is impossible to reflect all its characteristics, especially through its careful analysis. Of course, if the general population is not considered as a separate category of respondents (a stratum), of a determined age and gender, and rather narrow in characteristics - by gender, age, educational background, involvement in the certain activity, etc.
In this respect, a question arises if there is a minimum number of respondents which would satisfy all associative experiments? The procedure of creating a sample in every research has its internal logic and strategy, which is defined by the goals, objectives, and hypothesis of the study, and, of course, by the features of the general population (population), including its volume.
The aim of the present paper: on the basis of applied psycholinguistic research to determine and to provide rationalization for the peculiarities of creating an experimental sample of respondents - its structure, quantitative characteristics, as well as certain qualitative aspects in the course of experimental and broad psycholinguistic research.
The main objectives: to identify key concepts and criteria that reflect the characteristics (and components) of creating experimental groups; to experimentally substantiate the peculiarities of forming the quantitative and qualitative composition of samples in the conducted psycholinguistic research; to present the main strategies and methods of determining and estimating the composition when creating representative experimental groups of respondents.
Therefore, in the course of the preparation of the study and selection of subjects, carrying out the experiment and presenting the results there are some methodological complications and stereotypes regarding the sample of the respondents, which usually indicate two most common cases. The first case is ignorance of methodological requirements regarding the characteristics of creating groups of respondents, and secondly, insufficient attention to the methodological requirements for that matter. Both can be considered as significant errors, which respectively reduce the objectivity of the result, up to its inconsistency with the trends in the population.
As for the first case, it is advisable to pay attention to the methodology of scientific research within the framework of the modern paradigm, which provides sufficiently detailed requirements regarding the formation of the sample, including the most common artifacts and research errors. In the second case, it is necessary to emphasize the crucial importance of adherence to the basic requirements for the creation of an experimental sample for the possibility of further correct transfer of the results to a general population to be followed by the possibility to make predictions, which is usually the ultimate goal of the vast majority of studies, unless there is a goal to determine an essentially individual result.
Summarizing the main methodological mistakes in the formation of an experimental sample, one can distinguish the following:
most frequently, the authors of the research do not use or describe the strategy of selecting respondents, which does not allow others to understand either the technique of research of the author or what was taken as the characteristics of the general population. In the best case scenario, there is a more or less detailed description of the group of respondents that slightly clarifies the situation, but there is no complete picture;
it often happens that the number of respondents is accepted arbitrarily, and due to lack of information it is impossible to understand, why the author considers this number to be sufficient;
it is often accepted arbitrarily that the sufficient number is 100 to 200 respondents, even in the study of large general populations (for example, when transferring the result to the national level or to all people of the given age);
only a few authors analyze the quantitative and qualitative composition and usually do not explain the sufficiency of such quantity. Firstly, it violates the requirements to the description of the particular research technique and, in fact, it does not allow others to conduct a repeated research according to the author's criteria in order to check the assumptions and assertions that have been made on the basis of the obtained results;
the authors do not introduce the concept of the general population, which makes it impossible to determine the population to which the results have been transferred by the author, but it can be extremely important, especially in the case of narrowly focused and culturally colored researches;
in the description of respondents who participated in the study one way or another, there are no distinctions between such concepts as the sample of representation, representative sample, the final number of research results; in fact, researchers seldom differentiate between a sample and the respondents studied in the analysis of the final results;
besides, in the presence of control and experimental groups in the study, the way they have been formed is not always explained, but here the situation is somewhat better due to the fact that the selection of the experimental group is always explained by the logic of the particular research.
Unfortunately, in modern studies, presented by the authors in their scientific works, we observe a lot of methodological violations that undermine the objectivity of the results and do not allow us to achieve a complete understanding of the author's strategy. In this way they violate one of the basic conditions of scientific approach, i.e. the possibility to verify the results of any other researcher by means of following the author's methodology, which generally prevents any further scientific discussion.
In this paper we will not consider the peculiarities of key strategies of sample formation as they have been discussed in greater details in the methodological sources and they are not too complicated. But as far as the definition of the quantitative and qualitative composition is concerned, everything is a little more complicated, so let us try to illustrate the primary indicators and techniques we must rely upon while forming a group of respondents.
In the very beginning let us define the key concepts of our work, such as general population, sample representation, representative sample and the final number of results (respondents). The scope and purpose of the article do not allow us to carry out a detailed review of the basic concepts and approaches to these terms; therefore, we present their generalized interpretation by the authors of the study along with the main characteristics of these psychological phenomena. This material is based on a broad analysis of scientific papers and authors' experience of being practical scholars, including the authors of this work (Gordienko- Mytrofanova & Sauta, 2016).
General population - its definition is more or less clear - refers to all representatives of the population that can potentially participate in the study and which will subsequently be transferred to the results of a specific study. In this case, general population may be synonymous to the term population (Druzhinin, 2004; Atramentova & Utevska, 2007; Bochelyuk V.Y. & Bochelyuk, VV, 2008; Maksimenko & Nosenko, 2008).
Problems begin on the next level, i.e., a sample of representation. The selection of representations is that part of the general population, which utterly reflects its characteristics, it is achievable to the researcher, and from which the selection of respondents to participate in the experiment will be carried out. In this case, it should be emphasized that the sample of representation is not identical to the general population, but reflects its main characteristics, equivalent to it for key characteristics (Druzhinin, 2004; Bochelyuk, V.Y. & Bochelyuk, V.V., 2008). But it is necessary to take into account the fact that the transfer of the result is carried out on the general population, and not on the sample of representation. Therefore, when calculating the quantitative and, if necessary, qualitative composition of the sample, it is necessary to take into account the characteristics and the volume of the general population, on which the result will be transferred.
A representative sample is a group of respondents that reflect the main characteristics of the general population (that is why it is so important to determine these characteristics), and these respondents will take part in the research in this or that way. It is the group that is commonly referred to as a sample (Yadov & Semenova, 1998; Druzhinin, 2004; Atramentova & Utevska, 2007; Bochelyuk, V.Y. & Bochelyuk, VV, 2008; Maksimenko & Nosenko, 2008). It is important here to note two aspects: firstly, the quantitative composition must be sufficient to reflect the characteristics of the general population; secondly, it goes on about the selected subjects at the beginning of the work, and not about the final result. After all, any research process is dynamic, and its course and result may differ significantly from what was intended, although substantial deviations should be avoided as far as methodical procedures are concerned.
The next step is to create a control and experimental group within the representative sample, either at the beginning of the study or as a result of one of its stages. At the moment, it is not our purpose to analyze the peculiarities of creating of these groups; we only note some of the points important for understanding the peculiarities of determining the quantitative and qualitative composition of the representative sample. These features include an understanding of the fact that under conditions of sufficient equivalence of the control and experimental groups, they represent a homogeneous general population. In the case of opposing them on the basis of a key feature (gender, education, profession, etc.), we end up with two separate general populations, and this should be taken into account when creating representative samples.
Turning back to the concept of a representative sample and its involvement into direct research, we note that, as it is mentioned above, the study of the phenomenon is dynamic and even for attempts to maximally control its course, it is impossible to avoid certain methodological deviations (artifacts). One of the sources of artifacts can be the dynamics of groups. Many researchers who developed the methodological foundations of the experiment focused on these facts, which they considered it necessary to take into account when analyzing the final results. For example, D. Campbell and D. Martin distinguish the following factors that influence the composition of the sample, such as experimental dropouts (mortality), history (background), selection (selection), statistical regression, (Campbell, 1963; Martin, 2004). In our case, the most important is the experimental dropouts as a detachment subjects from the group for one or another reason. It can only be a partial participation in the study (illness, refusal to continue work), non-compliance with the instructions, which leads to distorted results (and the result has to be discarded), prohibition to use its results, etc. (Druzhinin, 2004; Martin, 2004; Bochelyuk, V.Y. & Bochelyuk, VV, 2008; Maksimenko & Nosenko, 2008).
All this leads to possible differences between the number of subjects included in the representative sample (sample) and the number of the subjects, and their results are taken into account in the final analysis, which becomes the basis for regularities and facts. And it is in this case we are talking about the final number of results (respondents), and this index can both coincide with the number of respondents in the experimental sample, and significantly differ from the predetermined quantitative composition of the sample. Therefore, when forming a sample, it is essential to consider not only the features and the volume of the general population but also the desired final number of results.
Accordingly, when estimating the volume of a representative sample, in addition to taking into account the qualitative composition of the general population, the quantity can be calculated according to the following approaches:
Selection on the basis of the estimated quantitative composition of the general population, upon which the result will be transferred.
Selection on the basis of the well-grounded desirable number of final results that can be transferred upon the general population.
In both previous cases, the estimation should take into account statistical errors.
Regarding the qualitative composition of a group or a population, within the scope of this work, we are interested only in the definition of the population that is the conceptual basis of the study. Besides, we consider the possibility of dividing one general population into several smaller ones by the purpose and hypothesis of the research. The latter is observed in the case described at the beginning of this work (which pushed us to this analysis). Namely, inside the general population, there are several key characteristics specified, which form a homogeneous sample. But first, this method does not allow us to objectively represent the general population, since it is not homogeneous and is already a violation of methodological requirements (if we want to get a generalized result). So, it refers to the balancing of a limited sample of criteria, which requires an equivalent transfer of the result. Secondly, there is certain subjectivity in the selection of criteria. To avoid this, it is usually recommended to apply randomization as a way to level external variables by way of mixing. In addition, by means of defining some clear categories within a very limited sample, we invariably divide the general population into separate smaller sets. However, in this case, each of them will be represented by too few respondents.
Methods and techniques of research
The main method of the conducted research is experimental, in particular, a psycholinguistic experiment, having been aimed at description the psycholinguistic meaning of the word «playfulness» as the most adequate and reliable model of systemic significance that reflects the reality of linguistic consciousness (Sternin, 2011: 188). The main stage of the research was the free association experiment (with the word-stimulus «playfulness») as the most elaborated technique of semantic analysis. It is worth mentioning that the choice of this stimulus is not random. Over the last fifteen years it has been possible to observe a radical reorientation from the fragmentary mention of playfulness in psychological texts to the foundation of the latter in psychology as a stable personality trait (Guitard et al., 2005; Yarnal & Qian, 2011; Proyer, 2012-2014b; Shen et al., 2014; Waldman-Levi et al., 2015; Bar- Haim Erez et al., 2016; Yue et al., 2016).
As additional methods the surveys have been applied, with the aim of refinement of a free associative experiment results; questioning for the specification of the characteristics of the sample; the psychodiagnostic method, in particular, the methodology of the «Differential-Diagnostic Questionnaire (DDQ)» by E. Klimov for dividing the experimental sample into stratum. As a mathematical-statistical method, the analysis of the results of the study used frequency and cluster analysis, the ф* criterion (Fischer z-transformation), which allowed to identify tendencies in the distribution of associations of experimental groups and strata to achieve the intended goals.
The hypothesis of experimental psycholinguistic research of the «playfulness» stimulus being a stable personality trait is to substantiate the effectiveness of special strategies of determination of the representative quantitative composition of the samples by comparing the frequency of the studied characteristics.
The actual psycholinguistic tasks are aimed at showing that «playfulness» is a relevant lexeme in the linguistic consciousness of the subjects; gender, age and professional factors influence the stimulus of «playfulness» only on the far periphery.
In our study, we define general population as the inhabitants of Ukraine (aged 18 to 75), whose linguistic consciousness is characterized by knowledge (including understanding) of Russian. This is primarily because the association tasks have been presented in Russian, and it was also assumed that in the territory of modern post-Soviet Ukraine, the majority of the population either fluent in Russian or understands it sufficiently freely to produce associations. The answer was not limited and was not emphasized on the language of the answer, but the vast majority of respondents provided the answer in Russian regardless of the region, there was no refusal or an excuse on the misunderstanding of the task.
According to statistics as of 01.01.2015 (the State Statistics Service of Ukraine website): the main general population of adult people, in total (from 18 to 75) is 35,145,000 people, 82% of the total population (42,759,700 people). This includes Russian-speaking, Ukrainian-speaking, bilingual, and those who understand Russian and able to express themselves in this language.
There are 460 cities, 885 urban-type settlements, as well as 28,385 other settlements (including settlements) in Ukraine. The country is divided into 24 oblasts. The urban population accounts for 62,13%, and the rural population is 37,87%. Ukrainians make up 77,8% of the population of Ukraine, and Russians - 17,3%. Belarusians, Moldovans, Crimean Tatars, Bulgarians, Hungarians, Romanians, Poles, Jews, Armenians, Greeks, Tatars and other ethnic groups, who were also involved in the research, live on the territory of the country.
Besides, according to the hypothesis of the study, further analysis of the results was foreseen for such indicators as age, gender, profession. This condition led to the division of the main general population into smaller ones according to the following key criteria.
Gender criterion: male - 49,09%, female - 50,91% of Ukraine's population.
Age criterion - youth (from 18 to 35 yo) is estimated to be 10,448,900 people; maturity (from 36 to 60 yo) - 15,366,100 people. The elderly (from 61 to 75 yo) - approximately 9,330,400 people - are not presented in this research as our work with this as a group is still in progress.
Here we consider it expedient to explain why at this stage of the study we have combined youth (17/18-21) and young age (21/22-35) into one age group. Systematic studies of the intellectual functions of the adults into the age range 18-35 years, conducted under the guidance of B. Ananiev, showed that the most considerable changes occur in short-term verbal memory, dealing with visual and auditory modality. The highest development rates are from the age of 18 to 30 years; then there is a slight decline. In turn, figurative memory is subjected to less change with age, and verbal impressions of long-term memory are characterized by a high continuity of indicators at the age of 18-35 years (Ananiev, 2001).
Among respondents, the key social categories (strata) were widely represented: in marital status (married, not married, divorced, cohabitation with a partner, engaged, widowed, having relationships, etc.). By level of education (with higher education, with incomplete higher education, secondary education, etc.). By the level of professional activity (students, employed, unemployed, retired, etc.).
Taking into account this data and the fact that smaller general aggregates (strata) will be singled out in the course of the analysis, the selection of the subjects in the sample has been carried out both quantitatively and qualitatively. The main strategy of selection is a random selection (randomization), stratification with quantitative balancing.
Sample 1 reflects the characteristics of the main general population. The purpose is to identify general trends in associations. At the initial stage of the study, all 24 oblasts of Ukraine were covered, as well as all types of settlements, together with 40 cities. It is this available part of the main general population that constituted the sample of representation. Investigators presented the subjects with word-stimulus «playfulness» and suggested to express the first (five) words that came to mind. The task was offered in the visual modality. According to the results of the activity, there were 2,902 respondents in Sample 1. General parameters of the sample: males - 1,187 persons (40,9%), age from 18 to 35 - 755, from 36 to 60 - 412, over 61 - 20; female - 1,715 people (59.1%), age from 18 to 35 - 1,310, from 36 to 60 - 381, over 61 - 24, according to characteristics close to the general population.
When the object of research or the general population is sufficiently large (for example, the population of Ukraine, districts, cities, families, etc.) and is multileveled, the sample is also of a multi-stage nature, the process of the sample construction proceeds in several stages. At the initial and every further intermediate stage, the objects of representation (selection units) are first selected, and only in the final stage the units of observation are added (for example, the oblast - the district - the city - the quarter - the house - the family - the person - the object). The main disadvantage of a multi-stage sample is the fact that the larger the number of stages is, the greater is the magnitude of the error. And for the preservation of objectivity, it is necessary to divide it into such groups of units that will fully reproduce its heterogeneity and, accordingly, the general population.
As a rule, at the first stage of building a multi-stage sampling, the stratification of the general population is applied. The strata are inherently homogeneous parts of the totality that are different from each other and which completely cover the totality (Maksimenko & Nosenko: 103). For example, the territory is divided into political and administrative regions, each of them - into settlements, of urban and rural types; the cities are divided by size, can be separated by age groups, social status, profession, etc. Stratification is appropriate to continue until the certain uniform distribution of the feature within each stratum is achieved. In fact, stratification is the procedure for the formation of several separate random subsamples from a certain population, which then are combined into one.
The number of units that are selected from each stratum can be determined in several ways:
From each stratum, some units are selected which is proportional to the volume of the stratum:
ni = n (Ni / N); (1)
where n is the sample size, N is the volume of the general population, Ni is the volume of an i-stratum.
For example, in our case, the general population - adult age, in total (from 18 to 75) is 35,145,000 people, when stratified by the age criterion, first stratum - youth (from 18 to 35) is approximately - 10,448,900 people, the number of respondents in Sample 1 was 2,902 persons, the respondents selected from the first stratum should be 863 persons (in our study this stratum is 400 people).
The number of units in each sub-sample is proportional to the standard deviation si of the i-stratum (this indicator is being calculated on the basis of the results of the study sample):
ni = n (Ni si / S Njsj); J = 1; (2)
where n is the sample size, Ni is the volume of the i-stratum, l is the number of groups.
The same number of units is selected from each stratum:
N1 = N2 = N3 = .... = Nk; (3)
In this case, the divergence in the number of different strata of the general population is ignored, but at the same time, there is an opportunity to sufficiently represent smaller strata in the sample. This very method, taking into account the width of the general population, was chosen to be the major one.
If at a certain stage of stratification uniformity is achieved in the distribution of a characteristic, which is interesting for the researcher. As a rule, the procedure of allocating the nests or series (serial sample) is usually carried out in the future. Unlike the strata, the nests are similar to each other; they are relatively homogeneous by certain attributes (Cochran, 1974). As a rule, the series is the smallest strata (if they are similar to each other on a certain significant sign): family, profession, level of education, etc. Selected samples of the series (nests) are subject to continuous or selective observation. It should be noted that, under the condition of a constant examination of the selected nests, the homogeneity of objects within each of them too understate the dispersion of a feature in comparison with the general population (for example, pupils of a particular school class or workers of a certain collective and, moreover, members of one family, would have many standard features).
It is possible to make a selection of a certain nest (series) within a stratum in the following ways:
on a random basis.
on the principle of the typical nest.
On the last stage of the actual implementation of random or systematic selection, as a rule, there appears a task to select a particular respondent from a circle of potentially possible.
At the end of these calculations, it is necessary to focus attention on the fact that the use of multistage samples results in one of the paradoxes of experimental psychology - on the one hand, with the help of stratification we distinguish groups from the general population, but due to the homogeneity of the strata, we are moving away from the heterogeneity of the population. This leads to the fact that in the case of finding the distinctions in executions, we will no longer be able to transfer the result to the main population, and the transfer should be
carried out by the characteristics of the strata. If we want to analyze
the peculiarities of the general population, we will have to return to Sample 1 (2,902 persons) that underwent stratified sampling.
Thus, using stratification, we obtained a general multilevel population and a representative sample out of that and were already formed from its number of samples of respondents depending on the aim.
Sample 2. The purpose is to determine the influence of gender and age on the nature and composition of associations.
Stratification of the general population has been carried out, which caused the sample to become multistage.
The sample according to the criteria «gender» (males and females - included both age groups) and «age» (18-35 and 36-60) included 1,600 respondents: 400 people in each sub-group of respondents. Within the framework of our research, we reduced the sample size from the point of view that the response rates in the samples of 1,600 and 3,000 respondents do not statistically differ (experimental data is given below). 400 people have been selected for each stratum in accordance with the above described approach of stratification (formula 3) together with the actual absence of significant statistical distinctions between samples of 1,600 and 400 people on the verge of non-significance/significance (see Table 3). This fact of approximation to the appearance of distinctions between the totality (a sample of representation) is objective, because, as it was mentioned above, in shows greater homogeneity.
Sample 3. The purpose is to determine the influence of the profession on the nature and structure of associations.
The next stage of the research was to identify common and specific features of respondents belonging to different professional backgrounds (types of occupations): «person - nature», «person - person», «person - sign systems», «person - technology», «person - artistic image», which are defined on the basis of the methodology developed by I. Klimov and his Differential-Diagnostic Questionnaire (DDQ), within the strata of youth (group 18-35), which is a period when a person tries to find his or her self. The sample comprised 500 people: 100 people in each «type of occupation», males and females being equally represented. In this case, we used the next level of the multilevel population - the allocation of nests. Regarding the quantitative composition, the allocation of nests implies a high degree of homogeneity in comparison with the general population, so that the conformity is not emphasized, which is what was proved by the statistical authenticity of the distinctions (see Table 3).
Now, after having analyzed the quantitative and qualitative composition of the general population and the samples of our research, in order to prove the validity of our number and show those failures, that often remain unnoticed, we present several variants.
Unfortunately, in modern psychological studies, the topic of methods and strategies of quantitative (and to a certain extent qualitative) formation of experimental samples is not sufficiently revealed and filling the gap is the purpose of this work. The orientation and affinity of this issue with similar in other scientific fields allowed us to rely not only on the work of psychologists, but also on the peculiarities of the disclosure of this issue in linguistics, sociology, political science, medical research, physiology, economics, and rely on mathematical and statistical developments. And it is this breadth of coverage of the problem that allows you to most effectively outline the main ways to solve it.
First, let us return to the concepts of large and small populations. Sometimes the quantitative characteristics of the sample are being taken arbitrarily, about 5-10% of its volume. Based on these notions, the research identifies the following indicative quantitative characteristics:
the small general population of up to 2,000 people - a sample of minimum 50 people;
the large general population of 5,000 people - a sample of minimum 200 people.
To demonstrate a certain validity of these data, give an example based on the most accessible and widespread group of the respondents: the pupils and the students. So often the researches carried out within the limits of one settlement with an average (city of regional significance) or large (city of regional significance, sometimes some small settlements) with a small population. In this case, we deal with a group of 3-10 schools within the settlement (Merefa, Kharkiv region, district center, with a population of about 25 thousand people - 5 schools and 2 lyceums; the city of Krasnograd, district center, population of about 23 thousand people, 5 secondary schools; one art school and one sports school) and more in big cities, but in the latter case a separate district (for example, not the largest district of the city of Kharkiv - Novobavarsky, with the population of about 110 thousand people, represented by 19 schools).
Coming back to quantitative indicators, regarding the composition of one school, if one speaks a single degree, for example, a junior school, it is usually represented by two parallels of 25 to 30 people from each form from 1 to 4, in total it is about 240 junior pupils. In the case of a separate school, there are 25 to 30 people (10%) in the sample. If we talk about the whole city (a territorial unit, a conglomeration) - this is in total about 1,000-2,500 thousand junior students - the sample can range from 50 to 200 people, which is already determined additionally, based on the purpose and hypothesis of the research.
It is more complicated than with large populations - large cities, oblast, country because even at lower percentages, quantitative indicators of the sample shows significant growth. In this case, we observe a tendency, which is directly opposite to small communities - in which, for a small number of respondents, the result is closer to the individual. This is a tendency to an excessive number of respondents - even at the lower limit, a minimum sample size of 250 people, or even more. Therefore, it should be noted that this is a very rough, indicative calculation, for precise studies, which require a special calculation or statistical survey, is not desirable.
For accurate, methodologically grounded research it is better to use special formulas to estimate the quantitative composition of the sample. The calculation, as it was noted, can be carried out in two ways - from the general population and from the number of desired final results. And these failures require the introduction of additional concepts and indicators, such as the number of objects depends on the determined conditions. Among these conditions is the accuracy of the study, the required probability of forecast, the power of the criterion, and so on. There should be large enough objects within the sample to form a correct idea of the general population. The more objects are there, the less statistical error is, but after reaching a certain amount of further increase in the sample does not significantly affect the result (Atramentova & Utevska, 2007).
Generalized methodological and terminological tools for calculating a sample and sample errors have not been fully formed yet in the modern methodology of experimental research. And against the background of the well-established mathematical and statistical apparatus, one can find various terminological and symbolic designations of the same criteria.
In order to provide rationalization for our research, we used the most widely-spread parameters:
Percentage of responses and losses during registration. Usually the acceptable volume of losses ranges between 5% and 20%, meaning that the volume of the experimental sample increases by 1,05-1,2 times;
Statistical power of research. It is arbitrarily considered as 100% of the result minus error probability. The power is usually taken as 80-99,9%. It can't be lower than 80%, but if it is absolutely important to make sure that the research does not miss any probable effect, the power of research must be no less than 90%;
Statistical error. As a rule, the level of significance is accepted as 5% and lower (the equivalent p=0,05);
Confidence interval takes into consideration the statistical error. For example, if a certain feature was detected with 70% of respondents, the real position of this feature is within the range of 65-75%, meaning that the error is +/- 5%.
These are the key criteria, but there are a number of other criteria that enhance the reliability of data:
Error of sample survey (accuracy) - difference (deviation) between the values of the parameters in general population and its sample value (Yadov & Semenova, 1998);
Share of feature (expected frequency of the result) - expected share of feature, for which the error is estimated. If no information on the share of feature is available, it is necessary to use a value that equals 50% when the maximum error occurs.
...Подобные документы
Problems in school and with parents. Friendship and love. Education as a great figure in our society. The structure of employed young people in Russia. Taking drugs and smoking as the first serious and actual problem. Informal movements or subcultures.
контрольная работа [178,7 K], добавлен 31.08.2014Overpopulation, pollution, Global Warming, Stupidity, Obesity, Habitat Destruction, Species Extinction, Religion. The influence of unemployment in America on the economy. The interaction of society with other societies, the emergence of global problems.
реферат [21,1 K], добавлен 19.04.2013Teenagers have a particular relationship with the world. They always try to express their individuality. Popular way of expressing the individuality. Teenagers join the group. The reasons of the problems. But are there only problems in teens life?
презентация [1,1 M], добавлен 26.05.2014The essence of social research communities and their development and functioning. Basic social theory of the XIX century. The main idea of Spencer. The index measuring inequality in income distribution Pareto. The principle of social action for Weber.
реферат [32,5 K], добавлен 09.12.2008Migration policies: The legal framework. The evolution of migration flows. Percentage of Portuguese emigration by district. Key migrant characteristics. Characteristics of legal migrants. Return migration. Portuguese emigration by destination, 1950-1988.
реферат [65,6 K], добавлен 25.06.2010Study the opinion of elderly people and young people about youth culture. Subculture as a group of people with the same interests and views on life. Passion for today's youth to heavy music, computers, dance parties and special styles of clothing.
презентация [654,6 K], добавлен 28.10.2014Race discriminations on ethnicity backgrounds. The Globalization and Racism in Media Age. African American writers about racism. Comparative analysis of the novel "To Kill a Mockingbird" Harper Lee and story "Going to Meet The Man" by James Baldwin.
дипломная работа [135,9 K], добавлен 29.03.2012The study of human populations. Demographic prognoses. The contemplation about future social developments. The population increase. Life expectancy. The international migration. The return migration of highly skilled workers to their home countries.
реферат [20,6 K], добавлен 24.07.2014The concept, definition, typology, characteristics of social institute. The functions of social institution: overt and latent. The main institution of society: structural elements. Social institutions of policy, economy, science and education, religion.
курсовая работа [22,2 K], добавлен 21.04.2014Description situation of the drugs in the world. Factors and tendencies of development of drugs business. Analysis kinds of drugs, their stages of manufacture and territory of sale. Interrelation of drugs business with other global problems of mankind.
курсовая работа [38,9 K], добавлен 13.09.2010Planning a research study. Explanation, as an ability to give a good theoretical background of the problem, foresee what can happen later and introduce a way of solution. Identifying a significant research problem. Conducting a pilot and the main study.
реферат [26,5 K], добавлен 01.04.2012Social network theory and network effect. Six degrees of separation. Three degrees of influence. Habit-forming mobile products. Geo-targeting trend technology. Concept of the financial bubble. Quantitative research method, qualitative research.
дипломная работа [3,0 M], добавлен 30.12.2015The collection and analysis of information with a view of improving the business marketing activities. Qualitative & Quantitative Research. Interviews, Desk Research, Test Trial. Search Engines. Group interviews and focus groups, Secondary research.
реферат [12,5 K], добавлен 17.02.2013Research methods are strategies or techniques to conduct a systematic research. To collect primary data four main methods are used: survey, observation, document analysis and experiment. Several problems can arise when using questionnaire. Interviewing.
реферат [16,7 K], добавлен 18.01.2009Stages and types of an applied sociological research. Sociological research process. Now researchers may formulate a hypothesis – a statement of the relationship between two or more concepts, the object’s structure, or possible ways to solve a problem.
реферат [15,6 K], добавлен 18.01.2009Comparative analysis of acronyms in English business registers: spoken, fiction, magazine, newspaper, non-academic, misc. Productivity acronyms as the most difficult problem in translation. The frequency of acronym formation in British National Corpus.
курсовая работа [145,5 K], добавлен 01.03.2015The problem of evaluation, self-assessment of personality as a psychological category. Factors of formation evaluation and self-esteem of children of primary school age. An experimental study of characteristics evaluation and self-esteem of junior pupils.
курсовая работа [28,6 K], добавлен 19.05.2011Concept of methods of research. Value of introduction of laboratory experiment and measurement in psychology. Supervision and experiment, their features. Methods of processing and interpretation of results of experiments. Rules of the conversation.
реферат [19,1 K], добавлен 31.10.2011The history of the company. Entering the market of pastas and the present position of the company. The problem of the company. The marketing research. The history of the market of pastas of Saint Petersburg and its present state.
курсовая работа [28,2 K], добавлен 03.11.2003Diesel fuel is any fuel used in diesel engines. Chemical composition and cetane number. Boiling point and freezing point of representative diesel fuel hydrocarbons. Disadvantages of Diesel Fuel. Environment hazards of sulfur. Fuel value and price.
реферат [39,2 K], добавлен 25.05.2012