VIII. Vocabulary 


Alternate Forms Reliability - A method of reliability that determines the degree to which two forms (such as Form A and Form B) of the same test are consistent. Researchers administer both forms to a sample of participants and correlate the scores.

Concurrent Validity - A form of criterion-related validity that refers to the degree that the actual test results are corresponding to the expected results. 

Content Validity -This type of validity refers to the competence of a test to measure the behavior that it initially was designed to measure.

Convergent Validation - An assessment highly correlates with another assessment that measures the same concept.

Construct Validity -This type of validity refers to the adequacy of the operational definition of the test in relation to the hypothesis.  In other words, the design and procedures of a test should be applicable to the original intended hypothesis.

Correlation - The simultaneous change in value of two numerically valued random variables. Correlation Coefficient = r = a value between .00 (no relationship) and 1.00 (a perfect relationship).

Criterion-Related Validity - Also known as instrumental validity, is used to demonstrate the accuracy of a measure or procedure by comparing it with another measure or procedure, which has already been demonstrated valid (a standard or criterion). There are two types: Concurrent validity and Predictive validity.

Discriminant Validation - As assessment does not highly correlate with another assessment that measures an unrelated concept.

Factor Analysis - A statistical technique used to simplify the number of components in a set of data. A researcher names the simplified components accordingly to their characteristics. 

Face Validity - This is NOT true validity.  It is a judgment as to whether the "appearance" of the test looks official to the person taking it.

Frequency Distribution - The systematic arrangement of scores on a measure to reflect how frequently each value on the measure occurred

Inter-Scorer Reliability - The degree of agreement between two or more scorers, judgers, or raters. 

Internal Consistency - The extent to which all items on a test measure the variable or construct.

Mean - The arithmetic average of the scores (Population Mean = µ; Sample Mean =  ̅x)

Mode - Most frequently occurring score

Median - Exact middle value of a distribution of scores arranging the scores from least to greatest.

Normal Distribution Curve - A symmetrical, mathematically defined, bell-shaped frequency distribution

Predictive Validity - How well scores on an assessment reliability predict future performance of the concept that was measured.

Range - The spread between the highest and lowest scores.

Reliability - The extent to which a test or measuring tool is consistent and replicable in its measurements. 

Standardization - Administrating carefully constructed tests to a large, representative sample of people under standard conditions for the purpose of determining norms.

Standard Deviation - A measure of spread within a distribution (the square root of the variance). The most popular and most reliable measure of variability. However, the more skewed the distribution, the more error there will be in the standard deviation because of its reliance of the mean. Standard Deviation = σ; Sample Standard Deviation = Sx

Split-Half Reliability - Is an internal consistency approach in which the scores from a single test are divided into two equal parts and correlated.

Test-Retest Reliability - A method of establishing a correlation coefficient is determined by comparing the scores of the same measuring device administered to the same people on two different occasions. Comparing test results over time allows the test developer to see how stable the test is over time.

Validity - The extent to which a test or measuring tool accurately measures what it was intended to measure.


Return