問題一覧
1
, statistic used to estimate how large a difference between two scores should be before the difference is considered statistically significant.
standard error of the difference
2
, the raw data from a test’s standardization sample converted to percentile form.
percentile norms
3
that which others are compared to or evaluated against. ● what is usual, generally accepted, or commonly employed.
standard
4
the need for varied, novel, and complex sensations and experiences and the willingness to take physical and social risks for the sake of such experiences
sensation seeking
5
absence of primacy of male or female characteristics.
androgynous
6
, process of developing a sample based on specific subgroups of a population
stratified sampling
7
, a long-standing assumption that factors other than what a test attempts to measure will influence performance on the test. ● traditionally: something that is more than expected; it is actually a component of the measurement process.
error
8
individuals who do not classify themselves on the masculine-feminine or male-female continuum.
gender non-binary
9
, process of developing a sample based on specific subgroups of a population in which every member has the same chance of being included in the sample.
stratified-random sampling
10
, portion of the universe of people deemed to be representative of the whole population.
sample
11
Assumption ● a world without tests would most likely be more a nightmare than a dream.
Assumption 7: Testing and Assessment Offer Powerful Benefits to Society
12
, a standard on which a judgment or decision may be based.
criterion
13
, process of selecting the portion of the universe deemed to be representative of the whole population
Sampling
14
consist of descriptive statistics based on a group of testtakers in a given period of time rather than norms obtained by formal sampling methods
user norms
15
, procedure for comparing scores on two or more tests that entails calculating percentile norms for each test and then identifying the score on each test that corresponds to the percentile
equipercentile method
16
an observable action or the product of an observable action, including test- or assessment-related responses.
overt behavior
17
, system of scoring wherein the distribution of scores obtained on the test from one group of testtakers (the fixed reference group) is used as the basis for the calculation of test scores for future administrations; the SAT (1926) and the GRE are scored this way
fixed reference group scoring system
18
, we arbitrarily select some sample because we believe it to be representative of the population
purposive sample
19
____, component of a test score attributable to sources other than the trait or ability measured.
error variance
20
“Whatever exists at all exists in some amount. To know it thoroughly involves knowing its quantity as well as its quality.”
e.l. thorndike
21
, provide normative information with respect to the local population’s performance on some test. ● developed by test users themselves ● Ex. individual high schools may wish to develop their own school norms
local norms
22
an informed, scientific concept developed or constructed to describe or explain behavior.
construct
23
process of administering a test to a representative sample of testtakers for the purpose of establishing norms ● a test is said to be standardized when it has clearly specified procedures for administration and scoring, typically including normative data.
standardization
24
one group of testtakers where the distribution scores are obtained.
fixed reference group
25
developed on the basis of any trait, ability, skill, or other characteristic that is presumed to develop, deteriorate, or otherwise be affected by chronological age, school grade, or stage of life.
developmental norms
26
the controversial practice of norming on the basis of race or ethnic background.
race norming
27
distribution of raw scores, to the number of items that were answered correctly multiplied by 100 and divided by the total number of items.
percentage correct
28
, in regression, an estimate of the degree of error involved in predicting the value of one variable from another
standard error of estimate
29
making or transforming something into something that can serve as a basis of comparison or judgment.
to standardize
30
, a reminder that a trait is not expected to be manifested in behavior 100% of the time
relatively enduring
31
consistency of the measuring tool: the precision with which the test measures and the extent to which error is present in measurements. ● yields the same numerical measurement every time it measures the same thing under the same conditions.
reliability
32
process of deriving norms. ● may be modified to describe a particular type of norm derivation
norming
33
● competent test users understand and appreciate the limitations of the tests they use as well as how those limitations might be compensated for by data from other sources.
Assumption 4: All Tests Have Limits and Imperfections
34
, derived from a normative sample that was nationally representative of the population at the time the norming study was conducted. ● obtained by testing large numbers of people representative of different variables such as age, gender, racial/ethnic background, socioeconomic strata.
national norms
35
if it does measure what it purports to measure. ● Ex. a test of reaction time is a ___ test if it accurately measures reaction time
validity
36
the test performance data of a particular group of testtakers that are designed for use as a reference when evaluating or interpreting individual test scores. ● provides a standard with which the results of measurement can be compared. ● literature: behavior that is usual, average, normal, standard, expected, or typical
norms
37
the average performance of different samples of testtakers who were at various ages at the time the test was administered.
age norms
38
, statistic used to estimate the extent to which an observed score deviates from a true score
standard error of measurement
39
designed to indicate the average test performance of testtakers in a given school grade ● developed by administering the test to representative samples of children over a range of consecutive grade levels (such as first through sixth grades).
grade norms
40
seeks to identify people who are high or low on this trait.
sensation-seeking scale
41
developmental norms
age norms, grade norms
42
covers a wide range of possible characteristics.
psychological trait
43
, permits the conversion of raw scores on the new version of the test into fixed reference group scores.
anchoring
44
an expression of the percentage of people whose score on a test or measure falls below a particular raw score. ● converted score that refers to a percentage of testtakers.
percentile
45
, an equivalency table for scores on two nationally standardized tests designed to measure the same thing ● provide some stability to test scores by anchoring them to other test scores.
national anchor norms
46
wherein a trait is measured by a series of test items. ● each response to a test item is converted to a number according to a test “key” (e.g., correct = 1 and incorrect = 0).
cumulative scoring
47
, process of arbitrarily selecting some people to be part of a sample because they are readily available, not because they are most representative of the population being studied
incidental sampling
48
, measure of sampling error.
standard error of the mean
49
evaluating the test score in relation to other scores on the same test ● area of focus is how an individual performed relative to other people who took the test.
norm-referenced
50
controversial than the 6
Assumption 6: Unfair and Biased Assessment Procedures Can Be Identified and Reformed
51
method of evaluation and a way of deriving meaning from test scores by evaluating an individual’s score with reference to a set standard. ● area of focus is the testtaker’s performance: what the testtaker can or cannot do; what the testtaker has or has not learned; whether the testtaker does or does not meet specified criteria
criterion-referenced testing and assessment
52
method of evaluation and a way of deriving meaning from test scores by evaluating an individual testtaker’s score and comparing it to scores of a group of testtakers. ● goal is to yield information on a testtaker’s standing or ranking relative to some comparison group of testtakers.
norm-referenced testing and assessment
53
also distinguish one person from another but are relatively less enduring.
states
54
any distinguishable, relatively enduring way in which one individual varies from another.
trait
55
norms for any defined group within a larger group ● what results from such segmentation are more narrow
subgroup norms
56
psychological tests may be used not to predict behavior but to ______ it—, to aid in the understanding of behavior that has already taken place. Give the assumption.
postdict
57
group of people presumed to be representative of the universe of people who may take a particular test and whose performance data on that test may be used as a reference source or context for evaluating individual test scores
normative sample