PSYCHOLOGICAL ASSESSMENT MIDTERM REVIEWER

PSYCHOLOGICAL ASSESSMENT MIDTERM REVIEWER
88問 • 2年前
  • GIAN CARLO FIESTA
  • 通報

    問題一覧

  • 1

    Which of the following criteria are essential for a good test?

    All of the above

  • 2

    Test users often speak of the psychometric soundness of tests, key aspects of which are:

    Fairness

  • 3

    This includes the notion that each individual measurement has an element of error such as observer's error, environmental changes, participant's changes etc.  It is the stability or consistency of the measurement.

    Reliability

  • 4

    A________ is defined as one on which test takers will fall in the same positions relative to each other

    Reliable Test

  • 5

    The extent to which measurements are consistent or repeatable and it is the extent to which measurements differ from occasion to occasion as a function of measurement error.

    Reliable Test

  • 6

    To check for reliability or Test of Reliability /, we use____________

    Coefficient Alpha

  • 7

    Reliability Coefficient should not go beyond_______

    (+/- 1)

  • 8

    The higher the coefficient alpha, __________

    The higher reliability

  • 9

    ______ happen when the test scores gain reliability as the number of items increases the higher the coefficient alpha, the higher the reliability.

    Reliability Coefficient

  • 10

    Theory behind reliability analysis, reliability analyses assume that test scores reflect two factors:

    Stable characteristics of the individual & Chance features of the individual

  • 11

    The true characteristics of the individua

    Stable characteristics of the individual

  • 12

    Random measurement of error

    Chance features of the individual

  • 13

    A value that, according to classical test theory, genuinely reflects an individual’s ability (or trait) level as measured by a particular test

    True score

  • 14

    If you are interested in the truth independent of measurement, you are not looking for the so-called true score, but what psychologists call the________________

    Construct score

  • 15

    The Classical Test Theory (CTT) formula

    X = T + e

  • 16

    X = T + e means...

    X = observed score, T = true score, e = measurement error

  • 17

    ___________________ the value of e should be close to 0 and the value of T should close to the actual test score x. Simply, most of the test scores should result from the measurement of true characteristics T.

    Using the classical test theory (CTT) in a reliable test

  • 18

    Proportion of test score reflecting person's true characteristics

    T/X

  • 19

    Proportion of test score reflecting random error

    X/T

  • 20

    Refers to the inherent uncertainty associated with any measurement, even after care has been taken to minimize preventable mistakes. Estimates of a quantity differ each time a measurement is taken—if only slightly. These fluctuations in measurement occur even when procedures are followed perfectly, and no obvious mistakes are made.

    Measurement error

  • 21

    Variance from true differences is________________

    True variance

  • 22

    Variance from irrelevant, random sources is__________

    Error variance

  • 23

    We can indirectly estimate how much the true score influences the observed score by___________

    Measuring the variability of test scores

  • 24

    The term reliability refers to the proportion of the total variance attributed to true variance. The greater the proportion of the total variance attributed to true variance, the more reliable the test.

    TRUE

  • 25

    Measurement error can be___________or___________

    systematic or random

  • 26

    A source of error in measuring a targeted variable, caused by unpredictable fluctuations and inconsistencies of other variables in the measurement process

    Random error

  • 27

    A source of error in measuring a variable that is typically constant and proportionate to what is presumed to be the true value of the variable being measured

    Systematic error

  • 28

    Test construction , Test administration, Test scoring and interpretation, Other sources of error are the ________

    Sources of error variance

  • 29

    A type of sources of error variance that has a item sampling/content sampling and the variety of the subject matter contained in the items; frequently referred to in the context of the variation between individual test items in a test or between test items in two or more tests.

    Test construction

  • 30

    Sources of error variance that occur during test administration may influence the testtaker’s attention or motivation.

    Test administration

  • 31

    Source of error variance that occur during test due to the factor of room temperature, level of lighting, and amount of ventilation and noise etc. - - - -

    Test environment

  • 32

    Source of error variance that occur during test due to the factor of pressing emotional problems, physical discomfort, lack of sleep, casual life experiences, illness, and changes in mood or mental state.

    test taker variables

  • 33

    Source of error variance that occur during test due to the factor of examiner’s physical appearance and demeanor— even the presence or absence of an examiner - - - -

    examiner-related variables

  • 34

    A type of sources of error variance that the advent of computer scoring and a growing reliance on objective, computer-scorable items have virtually eliminated error variance caused by scorer differences.

    Test scoring and interpretation

  • 35

    If subjectivity is involved in scoring, then the scorer (or rater)____________

    can be a source of error variance.

  • 36

    Reliability analysis correlates performance on two interval scale measures and uses that correlation to indicate the extent of true score differences by using some methods and these are:

    Test-retest method, Parallel/Alternate forms method, Split-half method, Inter-item consistency, and  Inter-scorer Reliability.

  • 37

    In measuring reliability____________reliability estimates are used to evaluate the error associated with administering a test at two different times. This type of analysis is of value when we measure "traits" or characteristics that do not change over time.

    Test-retest method

  • 38

    Test-retest method ideally, 6 months or more interval (when the interval between testing is greater than six months, estimates are called___________

    Coefficient of Stability

  • 39

    In measuring reliability____________reliability compares two equivalent forms of a test that measure the same attribute, the two forms use different items; however, the rules used to select items of a particular difficulty level are the same.

    Parallel / Alternate Forms method reliability

  • 40

    In measuring reliability____________, a test is given some and divided into halves that are scored separately and the results of one half of the test are then compared with the results of the other. Using the odd-even system, whereby one subscore is obtained for the odd number items in the test and another for the even-number items.

    Split-half method reliability

  • 41

    In Split-half method reliability we used__________ to allows us to estimate what the correlation between the two halves would have been if each half had been the length of the whole test.

    Spearman Brown formula

  • 42

    In measuring the reliability we used___________to know the degree of correlation among all the items on a scale, calculated using a single administration of a single test form and it is useful in assessing the homogeneity of the test.

    Inter-item Consistency

  • 43

    Methods in assessing internal consistency or Inter-item Consistency l Cronbach alpha/Coefficient Alpha ll R-20 lll KR-21

    all of the above

  • 44

    When measuring reliability using Inter-item Consistency_______is used to estimates the internal consistency of test in which the items are non-dichotomous or there is no right or wrong answer. Typically range from 0 to 1 (negative values are theoretically impossible)

    Coefficient alpha

  • 45

    It developed a more general reliability estimate which he called coefficient alpha

    Cronbach

  • 46

    When measuring reliability using Inter-item Consistency____________is used to calculates for the internal consistency of tests with dichotomous items with varying difficulty.

    Kuder-Richardson-20

  • 47

    When measuring reliability using Inter-item Consistency____________is used to calculates for internal consistency of tests with dichotomous items with equal/same difficulty - - - -

    Kuder-Richardson-21

  • 48

    In measuring reliability_____________ is a degree of agreement or consistency between two or more scorers (or judges, or raters) with regard to a particular measure. The judges rate the answers of the examinee and often used for creativity or projective tests.

    Inter-scorer reliability

  • 49

    Method: Test-Retest  NOF___NOS______Sources of Error/Variance_____

    1 - 2 - Changes over time

  • 50

    Method: Alternate Forms (Immediate) NOF___NOS______Sources of Error/Variance_____ - - - -

    2 - 1 - Item Sampling

  • 51

    Method: Alternate Forms (Delayed) NOF___NOS______Sources of Error/Variance_____

    2 - 2- Item sampling & Changes over time

  • 52

    Method: Split-Half NOF___NOS______Sources of Error/Variance_____

    1 - 1 - Item sample & Nature of split

  • 53

    Method: Coefficient alpha, KR NOF___NOS______Sources of Error/Variance_____ - - - -

    1 - 1 - Item sampling, Test Heterogeneity

  • 54

    Method: Inter-rater NOF___NOS______Sources of Error/Variance_____ - - - -

    1 - 1 - Scorer Differences

  • 55

    For critical decisions, such as medical diagnoses or hiring selections, what is the minimum acceptable coefficient of reliability.

    Coefficient of .95 or higher (Grade A)

  • 56

    In educational settings, what is the recommended range for the coefficient of reliability to ensure a reasonable level of consistency in test scores? - - - -

    Coefficient in the .80s (Grade B)

  • 57

    For non-critical decisions, such as general research studies or surveys, what is the lowest acceptable range for the coefficient of reliability?

    Coefficient in the .65 to .70 range (Weak, "barely passing" grade)

  • 58

    The nature of the test where items are functionally uniform throughout.

    Homogeneous

  • 59

    The nature of the test where an estimate of internal consistency might be low relative to a more appropriate estimate of test-retest reliability

    Heterogeneous

  • 60

    As long as the items are positively correlated, adding many items eventually results in high internal consistency coefficients, homogeneous or not.

    True

  • 61

    The nature of the test where a trait, state, or ability presumed to be ever-changing as a function of situational and cognitive experiences .

    Dynamic Characteristics

  • 62

    The nature of the test where a trait, state, or ability presumed to be relatively unchanging (such as intelligence)

    Static characteristic

  • 63

    A situation in nature of the test when the variance of either variable in a correlational analysis is restricted by the sampling procedure used, then the resulting correlation coefficient tends to be lower.

    Restriction or inflation of range (restriction/inflation of variance)

  • 64

    If the variance of either variable in a correlational analysis is inflated by the sampling procedure, then the resulting correlation coefficient tends to be lower.

    False

  • 65

    It is generally contains items of uniform level of difficulty (typically uniformly low) so that, when given generous time limits, all testtakers should be able to complete all the test items correctly.

    Speed test

  • 66

    When a time limit is long enough to allow testtakers to attempt all items, and if some items are so difficult that no testtaker is able to obtain a perfect score

    Power test

  • 67

    Which of the following is not a method for estimating the reliability of a speed test based on performance from two independent testing periods?

    Split-quarter reliability

  • 68

    It is designed to provide an indication of where a testtaker stands with respect to some variable or criterion, such as an educational or a vocational objective. This nature of the test wants to know  how different the scores are from one another is seldom a focus of interest. In fact, individual differences between examinees on total test scores may be minimal. The critical issue for the user of a mastery test is whether a certain criterion score has been achieved.

    Criterion-referenced test

  • 69

    In True Score Model of Measurement and its Alternatives it is referred to as the true score model of measurement; most widely used and accepted model in psychometric literature.

    Classical Test Theory (CTT)

  • 70

    In True Score Model of Measurement and its Alternatives it is a test’s reliability is conceived of as an objective measure of how precisely the test score assesses the domain from which the test draws a sample - - - -

    Domain Sampling Theory

  • 71

    The universe of items that could conceivably measure that behavior, can be thought of as a hypothetical construct: one that shares certain characteristics with (and is measured by) the sample of items that make up the test. - - - -

    Domain of behavior

  • 72

    It is based on the idea that a person’s test scores vary from testing to testing because of variables in the testing situation.

    Generalizability Theory

  • 73

    It is referred to as latent-trait theory or the latent-trait model, a system of assumptions about measurement (including the assumption that a trait being measured by a test is unidimensional) and the extent to which each test item measures the trait.

    Item Response Theory (IRT)

  • 74

    The item response theory (IRT) is classified in to two which are:

    Item Difficulty Index & Item Discrimination Index

  • 75

    A statistic indicating how many testtakers responded correctly to an item

    Item Difficulty Index

  • 76

    Statistic designed to indicate how adequately a test item discriminates between high and low scorers

    Item Discrimination Index

  • 77

    The two different IRT models that handle different types of data - - - -

    dichotomous test items & polytomous test items

  • 78

    Test items or questions that can be answered with only one of two alternative responses, such as true–false, yes–no, or correct–incorrect questions.

    Dichotomous test items

  • 79

    Test items or questions with three or more alternative responses, where only one is scored correct or scored as being consistent with a targeted trait or other construct - - -   -  

    Polytomous test items

  • 80

    An index of the amount of inconsistency or the amount of expected error in an individual's score and it estimates how repeated measures of a person on the same instrument tend to be distributed around their own "true score" it also allows us to quantify the extent to which a test provides accurate scores. The standard deviation of sample scores multiplied by the square root of 1 minus the reliability of the scores

    Standard Error of Measurement (SEM)

  • 81

    ___________________also known as standard error of a score If a test has a large SEM value, the test is an unreliable measure of a psychological construct

    Standard Error of Measurement (SEM)

  • 82

    In order to use the standard error of measurement to estimate the range of the true score, we make an assumption:

    If the individual were to take a large number of equivalent tests, scores on those tests would tend to be normally distributed, with the individual’s true score as the mean.

  • 83

    A student scores 75 on a math test, and the standard error of measurement (SEM) is 3. What can we infer about the student's true score at a 68% confidence level?

    The true score is likely between 72 and 78.

  • 84

    An athlete's performance is measured with a standard error of measurement (SEM) of 2. If the athlete scores 45, what is the range of scores within which we can be 95% confident the true score falls?

    The true score is likely between 43 and 47.

  • 85

    A researcher is conducting a study and measures a participant's response time with a standard error of measurement (SEM) of 1.5. If the participant's response time is 30 milliseconds, what is the range of scores within which we can be 99% confident the true score falls?

    The true score is likely between 27 and 33.

  • 86

    A range or band of test scores that is likely to contain the true score -

    Confidence interval

  • 87

    (Analyze this) if a student achieved a score of 50 on one spelling test and if the test had a standard error of measurement of 4, then—using 50 as the point estimate—we can be: ■ 68% confident that the true score falls within 50 ± 1σmeas (or between 46 and 54, including 46 and 54); ■ 95% confident that the true score falls within 50 ± 2σmeas (or between 42 and 58, including 42 and 58); ■ 99% confident that the true score falls within 50 ± 3σmeas (or between 38 and 62, including 38 and 62).

    NOTED

  • 88

    Which of the following statements about the relationship between Validity and Reliability is true? l -As reliability of the test increases, the highest possible value of the validity coefficient increases. ll -Reliability and validity are partially related and partially independent lll- Reliability is a prerequisite for validity - a measure cannot be valid unless it is reliable

    ALL OF THEM

  • PSYCHOLOGICAL ASSESSMENT PRELIM EXAM

    PSYCHOLOGICAL ASSESSMENT PRELIM EXAM

    GIAN CARLO FIESTA · 60問 · 2年前

    PSYCHOLOGICAL ASSESSMENT PRELIM EXAM

    PSYCHOLOGICAL ASSESSMENT PRELIM EXAM

    60問 • 2年前
    GIAN CARLO FIESTA

    ABNORMAL PSYCHOLOGY PRELIM EXAM

    ABNORMAL PSYCHOLOGY PRELIM EXAM

    GIAN CARLO FIESTA · 75問 · 2年前

    ABNORMAL PSYCHOLOGY PRELIM EXAM

    ABNORMAL PSYCHOLOGY PRELIM EXAM

    75問 • 2年前
    GIAN CARLO FIESTA

    EXAM

    EXAM

    GIAN CARLO FIESTA · 34問 · 1年前

    EXAM

    EXAM

    34問 • 1年前
    GIAN CARLO FIESTA

    DEV PSYCH 1

    DEV PSYCH 1

    GIAN CARLO FIESTA · 30問 · 1年前

    DEV PSYCH 1

    DEV PSYCH 1

    30問 • 1年前
    GIAN CARLO FIESTA

    AB PSYCH 1

    AB PSYCH 1

    GIAN CARLO FIESTA · 62問 · 1年前

    AB PSYCH 1

    AB PSYCH 1

    62問 • 1年前
    GIAN CARLO FIESTA

    PSYCH ASSESSMENT 1

    PSYCH ASSESSMENT 1

    GIAN CARLO FIESTA · 100問 · 1年前

    PSYCH ASSESSMENT 1

    PSYCH ASSESSMENT 1

    100問 • 1年前
    GIAN CARLO FIESTA

    PSYCH ASSESSMENT 2

    PSYCH ASSESSMENT 2

    GIAN CARLO FIESTA · 87問 · 1年前

    PSYCH ASSESSMENT 2

    PSYCH ASSESSMENT 2

    87問 • 1年前
    GIAN CARLO FIESTA

    IO PSYCH 1

    IO PSYCH 1

    GIAN CARLO FIESTA · 100問 · 1年前

    IO PSYCH 1

    IO PSYCH 1

    100問 • 1年前
    GIAN CARLO FIESTA

    IO PSYCH 2

    IO PSYCH 2

    GIAN CARLO FIESTA · 75問 · 1年前

    IO PSYCH 2

    IO PSYCH 2

    75問 • 1年前
    GIAN CARLO FIESTA

    問題一覧

  • 1

    Which of the following criteria are essential for a good test?

    All of the above

  • 2

    Test users often speak of the psychometric soundness of tests, key aspects of which are:

    Fairness

  • 3

    This includes the notion that each individual measurement has an element of error such as observer's error, environmental changes, participant's changes etc.  It is the stability or consistency of the measurement.

    Reliability

  • 4

    A________ is defined as one on which test takers will fall in the same positions relative to each other

    Reliable Test

  • 5

    The extent to which measurements are consistent or repeatable and it is the extent to which measurements differ from occasion to occasion as a function of measurement error.

    Reliable Test

  • 6

    To check for reliability or Test of Reliability /, we use____________

    Coefficient Alpha

  • 7

    Reliability Coefficient should not go beyond_______

    (+/- 1)

  • 8

    The higher the coefficient alpha, __________

    The higher reliability

  • 9

    ______ happen when the test scores gain reliability as the number of items increases the higher the coefficient alpha, the higher the reliability.

    Reliability Coefficient

  • 10

    Theory behind reliability analysis, reliability analyses assume that test scores reflect two factors:

    Stable characteristics of the individual & Chance features of the individual

  • 11

    The true characteristics of the individua

    Stable characteristics of the individual

  • 12

    Random measurement of error

    Chance features of the individual

  • 13

    A value that, according to classical test theory, genuinely reflects an individual’s ability (or trait) level as measured by a particular test

    True score

  • 14

    If you are interested in the truth independent of measurement, you are not looking for the so-called true score, but what psychologists call the________________

    Construct score

  • 15

    The Classical Test Theory (CTT) formula

    X = T + e

  • 16

    X = T + e means...

    X = observed score, T = true score, e = measurement error

  • 17

    ___________________ the value of e should be close to 0 and the value of T should close to the actual test score x. Simply, most of the test scores should result from the measurement of true characteristics T.

    Using the classical test theory (CTT) in a reliable test

  • 18

    Proportion of test score reflecting person's true characteristics

    T/X

  • 19

    Proportion of test score reflecting random error

    X/T

  • 20

    Refers to the inherent uncertainty associated with any measurement, even after care has been taken to minimize preventable mistakes. Estimates of a quantity differ each time a measurement is taken—if only slightly. These fluctuations in measurement occur even when procedures are followed perfectly, and no obvious mistakes are made.

    Measurement error

  • 21

    Variance from true differences is________________

    True variance

  • 22

    Variance from irrelevant, random sources is__________

    Error variance

  • 23

    We can indirectly estimate how much the true score influences the observed score by___________

    Measuring the variability of test scores

  • 24

    The term reliability refers to the proportion of the total variance attributed to true variance. The greater the proportion of the total variance attributed to true variance, the more reliable the test.

    TRUE

  • 25

    Measurement error can be___________or___________

    systematic or random

  • 26

    A source of error in measuring a targeted variable, caused by unpredictable fluctuations and inconsistencies of other variables in the measurement process

    Random error

  • 27

    A source of error in measuring a variable that is typically constant and proportionate to what is presumed to be the true value of the variable being measured

    Systematic error

  • 28

    Test construction , Test administration, Test scoring and interpretation, Other sources of error are the ________

    Sources of error variance

  • 29

    A type of sources of error variance that has a item sampling/content sampling and the variety of the subject matter contained in the items; frequently referred to in the context of the variation between individual test items in a test or between test items in two or more tests.

    Test construction

  • 30

    Sources of error variance that occur during test administration may influence the testtaker’s attention or motivation.

    Test administration

  • 31

    Source of error variance that occur during test due to the factor of room temperature, level of lighting, and amount of ventilation and noise etc. - - - -

    Test environment

  • 32

    Source of error variance that occur during test due to the factor of pressing emotional problems, physical discomfort, lack of sleep, casual life experiences, illness, and changes in mood or mental state.

    test taker variables

  • 33

    Source of error variance that occur during test due to the factor of examiner’s physical appearance and demeanor— even the presence or absence of an examiner - - - -

    examiner-related variables

  • 34

    A type of sources of error variance that the advent of computer scoring and a growing reliance on objective, computer-scorable items have virtually eliminated error variance caused by scorer differences.

    Test scoring and interpretation

  • 35

    If subjectivity is involved in scoring, then the scorer (or rater)____________

    can be a source of error variance.

  • 36

    Reliability analysis correlates performance on two interval scale measures and uses that correlation to indicate the extent of true score differences by using some methods and these are:

    Test-retest method, Parallel/Alternate forms method, Split-half method, Inter-item consistency, and  Inter-scorer Reliability.

  • 37

    In measuring reliability____________reliability estimates are used to evaluate the error associated with administering a test at two different times. This type of analysis is of value when we measure "traits" or characteristics that do not change over time.

    Test-retest method

  • 38

    Test-retest method ideally, 6 months or more interval (when the interval between testing is greater than six months, estimates are called___________

    Coefficient of Stability

  • 39

    In measuring reliability____________reliability compares two equivalent forms of a test that measure the same attribute, the two forms use different items; however, the rules used to select items of a particular difficulty level are the same.

    Parallel / Alternate Forms method reliability

  • 40

    In measuring reliability____________, a test is given some and divided into halves that are scored separately and the results of one half of the test are then compared with the results of the other. Using the odd-even system, whereby one subscore is obtained for the odd number items in the test and another for the even-number items.

    Split-half method reliability

  • 41

    In Split-half method reliability we used__________ to allows us to estimate what the correlation between the two halves would have been if each half had been the length of the whole test.

    Spearman Brown formula

  • 42

    In measuring the reliability we used___________to know the degree of correlation among all the items on a scale, calculated using a single administration of a single test form and it is useful in assessing the homogeneity of the test.

    Inter-item Consistency

  • 43

    Methods in assessing internal consistency or Inter-item Consistency l Cronbach alpha/Coefficient Alpha ll R-20 lll KR-21

    all of the above

  • 44

    When measuring reliability using Inter-item Consistency_______is used to estimates the internal consistency of test in which the items are non-dichotomous or there is no right or wrong answer. Typically range from 0 to 1 (negative values are theoretically impossible)

    Coefficient alpha

  • 45

    It developed a more general reliability estimate which he called coefficient alpha

    Cronbach

  • 46

    When measuring reliability using Inter-item Consistency____________is used to calculates for the internal consistency of tests with dichotomous items with varying difficulty.

    Kuder-Richardson-20

  • 47

    When measuring reliability using Inter-item Consistency____________is used to calculates for internal consistency of tests with dichotomous items with equal/same difficulty - - - -

    Kuder-Richardson-21

  • 48

    In measuring reliability_____________ is a degree of agreement or consistency between two or more scorers (or judges, or raters) with regard to a particular measure. The judges rate the answers of the examinee and often used for creativity or projective tests.

    Inter-scorer reliability

  • 49

    Method: Test-Retest  NOF___NOS______Sources of Error/Variance_____

    1 - 2 - Changes over time

  • 50

    Method: Alternate Forms (Immediate) NOF___NOS______Sources of Error/Variance_____ - - - -

    2 - 1 - Item Sampling

  • 51

    Method: Alternate Forms (Delayed) NOF___NOS______Sources of Error/Variance_____

    2 - 2- Item sampling & Changes over time

  • 52

    Method: Split-Half NOF___NOS______Sources of Error/Variance_____

    1 - 1 - Item sample & Nature of split

  • 53

    Method: Coefficient alpha, KR NOF___NOS______Sources of Error/Variance_____ - - - -

    1 - 1 - Item sampling, Test Heterogeneity

  • 54

    Method: Inter-rater NOF___NOS______Sources of Error/Variance_____ - - - -

    1 - 1 - Scorer Differences

  • 55

    For critical decisions, such as medical diagnoses or hiring selections, what is the minimum acceptable coefficient of reliability.

    Coefficient of .95 or higher (Grade A)

  • 56

    In educational settings, what is the recommended range for the coefficient of reliability to ensure a reasonable level of consistency in test scores? - - - -

    Coefficient in the .80s (Grade B)

  • 57

    For non-critical decisions, such as general research studies or surveys, what is the lowest acceptable range for the coefficient of reliability?

    Coefficient in the .65 to .70 range (Weak, "barely passing" grade)

  • 58

    The nature of the test where items are functionally uniform throughout.

    Homogeneous

  • 59

    The nature of the test where an estimate of internal consistency might be low relative to a more appropriate estimate of test-retest reliability

    Heterogeneous

  • 60

    As long as the items are positively correlated, adding many items eventually results in high internal consistency coefficients, homogeneous or not.

    True

  • 61

    The nature of the test where a trait, state, or ability presumed to be ever-changing as a function of situational and cognitive experiences .

    Dynamic Characteristics

  • 62

    The nature of the test where a trait, state, or ability presumed to be relatively unchanging (such as intelligence)

    Static characteristic

  • 63

    A situation in nature of the test when the variance of either variable in a correlational analysis is restricted by the sampling procedure used, then the resulting correlation coefficient tends to be lower.

    Restriction or inflation of range (restriction/inflation of variance)

  • 64

    If the variance of either variable in a correlational analysis is inflated by the sampling procedure, then the resulting correlation coefficient tends to be lower.

    False

  • 65

    It is generally contains items of uniform level of difficulty (typically uniformly low) so that, when given generous time limits, all testtakers should be able to complete all the test items correctly.

    Speed test

  • 66

    When a time limit is long enough to allow testtakers to attempt all items, and if some items are so difficult that no testtaker is able to obtain a perfect score

    Power test

  • 67

    Which of the following is not a method for estimating the reliability of a speed test based on performance from two independent testing periods?

    Split-quarter reliability

  • 68

    It is designed to provide an indication of where a testtaker stands with respect to some variable or criterion, such as an educational or a vocational objective. This nature of the test wants to know  how different the scores are from one another is seldom a focus of interest. In fact, individual differences between examinees on total test scores may be minimal. The critical issue for the user of a mastery test is whether a certain criterion score has been achieved.

    Criterion-referenced test

  • 69

    In True Score Model of Measurement and its Alternatives it is referred to as the true score model of measurement; most widely used and accepted model in psychometric literature.

    Classical Test Theory (CTT)

  • 70

    In True Score Model of Measurement and its Alternatives it is a test’s reliability is conceived of as an objective measure of how precisely the test score assesses the domain from which the test draws a sample - - - -

    Domain Sampling Theory

  • 71

    The universe of items that could conceivably measure that behavior, can be thought of as a hypothetical construct: one that shares certain characteristics with (and is measured by) the sample of items that make up the test. - - - -

    Domain of behavior

  • 72

    It is based on the idea that a person’s test scores vary from testing to testing because of variables in the testing situation.

    Generalizability Theory

  • 73

    It is referred to as latent-trait theory or the latent-trait model, a system of assumptions about measurement (including the assumption that a trait being measured by a test is unidimensional) and the extent to which each test item measures the trait.

    Item Response Theory (IRT)

  • 74

    The item response theory (IRT) is classified in to two which are:

    Item Difficulty Index & Item Discrimination Index

  • 75

    A statistic indicating how many testtakers responded correctly to an item

    Item Difficulty Index

  • 76

    Statistic designed to indicate how adequately a test item discriminates between high and low scorers

    Item Discrimination Index

  • 77

    The two different IRT models that handle different types of data - - - -

    dichotomous test items & polytomous test items

  • 78

    Test items or questions that can be answered with only one of two alternative responses, such as true–false, yes–no, or correct–incorrect questions.

    Dichotomous test items

  • 79

    Test items or questions with three or more alternative responses, where only one is scored correct or scored as being consistent with a targeted trait or other construct - - -   -  

    Polytomous test items

  • 80

    An index of the amount of inconsistency or the amount of expected error in an individual's score and it estimates how repeated measures of a person on the same instrument tend to be distributed around their own "true score" it also allows us to quantify the extent to which a test provides accurate scores. The standard deviation of sample scores multiplied by the square root of 1 minus the reliability of the scores

    Standard Error of Measurement (SEM)

  • 81

    ___________________also known as standard error of a score If a test has a large SEM value, the test is an unreliable measure of a psychological construct

    Standard Error of Measurement (SEM)

  • 82

    In order to use the standard error of measurement to estimate the range of the true score, we make an assumption:

    If the individual were to take a large number of equivalent tests, scores on those tests would tend to be normally distributed, with the individual’s true score as the mean.

  • 83

    A student scores 75 on a math test, and the standard error of measurement (SEM) is 3. What can we infer about the student's true score at a 68% confidence level?

    The true score is likely between 72 and 78.

  • 84

    An athlete's performance is measured with a standard error of measurement (SEM) of 2. If the athlete scores 45, what is the range of scores within which we can be 95% confident the true score falls?

    The true score is likely between 43 and 47.

  • 85

    A researcher is conducting a study and measures a participant's response time with a standard error of measurement (SEM) of 1.5. If the participant's response time is 30 milliseconds, what is the range of scores within which we can be 99% confident the true score falls?

    The true score is likely between 27 and 33.

  • 86

    A range or band of test scores that is likely to contain the true score -

    Confidence interval

  • 87

    (Analyze this) if a student achieved a score of 50 on one spelling test and if the test had a standard error of measurement of 4, then—using 50 as the point estimate—we can be: ■ 68% confident that the true score falls within 50 ± 1σmeas (or between 46 and 54, including 46 and 54); ■ 95% confident that the true score falls within 50 ± 2σmeas (or between 42 and 58, including 42 and 58); ■ 99% confident that the true score falls within 50 ± 3σmeas (or between 38 and 62, including 38 and 62).

    NOTED

  • 88

    Which of the following statements about the relationship between Validity and Reliability is true? l -As reliability of the test increases, the highest possible value of the validity coefficient increases. ll -Reliability and validity are partially related and partially independent lll- Reliability is a prerequisite for validity - a measure cannot be valid unless it is reliable

    ALL OF THEM