GMAT Focus Edition Verbal: Evaluate questions need information that swings BOTH ways. The two-direction test is your primary tool.
Home Course Verbal Reasoning Lesson 12
Verbal Theory • Lesson 12 of 20

CR Evaluate &
Two-Direction Analysis

Find the information that cuts both ways. The correct choice can strengthen OR weaken the argument depending on whether the answer is yes or no.

Time: 55 mins
Target: V78 to V90
Prerequisites: Lessons 1–4 (CR fundamentals)
Course Verbal Reasoning Lesson 12
1

Core Philosophy: Information That Could Cut Either Way

Evaluate questions ask you to find the piece of information that would be MOST USEFUL in evaluating the argument — meaning the information whose answer could either strengthen OR weaken the argument depending on what the answer turns out to be.

The defining feature of a correct evaluate answer is its two-directional potential: if the answer to the question in the choice is "yes," the argument is strengthened; if "no," it is weakened (or vice versa). This distinguishes evaluate answers from pure strengtheners or weakeners.

Core Insight: The correct evaluate answer must swing the argument in BOTH directions. If an answer choice can only strengthen or only weaken, it is wrong.

2

What Makes an Evaluate Answer Correct

The Two-Direction Test
Answer to evaluate choice = YES
→ Argument is strengthened (or conclusion becomes more likely)
Answer to evaluate choice = NO
→ Argument is weakened (or conclusion becomes less likely)
⚠ If the answer is "yes" → strengthens AND "no" → also strengthens (or both weaken), the choice is wrong — it can only move in one direction.
3

The Two-Direction Test Strategy

01

Identify the argument's core assumption

The most useful information is almost always related to the argument's central assumption or causal bridge.

02

For each choice, answer "yes" and "no"

Apply the two-direction test: if yes → what happens to the argument? If no → what happens? The correct choice must affect the argument differently depending on the answer.

03

Eliminate one-directional choices

If yes and no both strengthen (or both weaken) the argument, the choice does not evaluate — it only confirms or denies.

04

Select the most pivotal choice

The correct choice addresses the most central question in the argument — the answer with the greatest ability to change your assessment.

4

Worked Examples

Example 1

Argument: "After the city opened a new subway line, traffic on the parallel highway decreased by 15%. The subway is responsible for the traffic reduction."

Best evaluate choice: "Did highway traffic in comparable cities without new subway lines also decrease during the same period?"
If YES: Traffic declined everywhere → subway may not be the cause → weakens.
If NO: Traffic declined only in this city → subway likely contributed → strengthens.
This choice can cut either way → correct evaluate answer.
Example 2 — Wrong Direction

Same argument above. Consider: "Whether the subway line serves popular destinations."

If YES: Subway is well-used → slightly supports the argument.
If NO: Subway is unused → weakens even harder.
Both answers could weaken (neither strengthens cleanly). But more importantly, the choice doesn't directly address the alternative-cause issue. Less powerful than the correct choice.
5

10 Evaluate Question Traps

1. One-directional choice

A choice that always strengthens (or always weakens) regardless of the answer is not an evaluate choice.

2. Out-of-scope question

A choice asking about an issue tangential to the argument's core assumption.

3. Already-answered question

A choice whose answer is already provided in the stimulus.

4. True-either-way trap

The correct answer must be uncertain — if the answer is obviously yes or no, it fails to evaluate.

5. Overly specific question

A choice that asks about a minor detail rather than the pivotal assumption.

6. Mechanism vs evaluation

Explaining how something works doesn't evaluate whether the argument's conclusion follows.

7. General vs specific relevance

A choice that would evaluate all arguments of this type, not specifically this one, is too general.

8. Counter-example trap

A choice that could only produce a counter-example, not genuine two-directional evaluation.

9. Loaded question trap

A choice that assumes something not in the stimulus as its premise.

10. Statistical precision trap

Asking for exact statistics when the argument would not be changed by the precision.

6

Evaluate vs Strengthen/Weaken — The Critical Distinction

FeatureEvaluateStrengthen/Weaken
Answer formA question whose answer could go either wayA statement that moves in one direction
Test to applyYes → strengthen AND no → weaken (or vice versa)Does this make the conclusion more/less likely?
Correct answer looks likeA question about the key assumption or alternative causeNew information relevant to the argument's gap
Wrong answer patternChoices that only go one directionChoices that are out of scope or neutral
9

10 GMAT-Style Practice Questions

Select your answer, then reveal the step-by-step explanation. Each question reflects real GMAT difficulty and format.

Question 1 of 10 GMAT Verbal

A company replaced its existing customer service team with an AI chatbot. Customer satisfaction scores subsequently rose by 22%. Management concluded that the AI chatbot was responsible for the improvement. Which of the following would be MOST useful in evaluating the management's conclusion?

Correct Answer: (B)
(B) is correct. Apply the two-direction test: If YES (other companies also saw 22% improvement) → external trends explain it, not the AI → weakens. If NO (other companies saw no improvement) → the AI is likely the cause → strengthens. (A) is about vendor source — doesn't evaluate causation. (C) is about capability — doesn't evaluate whether it caused the improvement. (D) is about measurement consistency — addresses a different issue. (E) is about cost — doesn't evaluate causation.
Question 2 of 10 GMAT Verbal

A nutrition researcher argues that daily consumption of green tea reduces the risk of developing type 2 diabetes, citing a study in which regular green tea drinkers had a 28% lower incidence of diabetes over ten years. Which of the following would be most useful in evaluating this argument?

Correct Answer: (B)
(B) is correct. Two-direction test: If YES (green tea drinkers also exercised more and ate better) → healthier lifestyle explains the lower diabetes rate, not the green tea → weakens. If NO (tea drinkers and non-drinkers had similar diet/exercise habits) → green tea likely contributed → strengthens. (A) is a mechanism — provides theoretical support but doesn't evaluate the study's causal conclusion. (C) affects generalizability, not the core causal claim. (D) is about statistical significance — relevant but less pivotal than confounders.
Question 3 of 10 GMAT Verbal

An airline concluded that a new boarding process, which reversed the traditional back-to-front boarding order, would reduce average boarding time by 10 minutes per flight. Which of the following would be most useful in evaluating this conclusion?

Correct Answer: (B)
(B) is correct. Two-direction test: If YES (the process worked consistently across different flight types) → the 10-minute reduction is likely to hold in practice → strengthens. If NO (only worked on specific aircraft configurations) → the claim may not generalize → weakens. (A) is about passenger preference, not time savings. (C) is about correlation between boarding style and punctuality, which doesn't directly test the 10-minute claim. (D) asks about the impact of the result, not the validity of the claim. (E) is about training costs, not the time reduction claim.
Question 4 of 10 GMAT Verbal

A city installed speed cameras on all major roads and subsequently reported a 30% decrease in traffic fatalities. The city council concluded that the speed cameras caused the reduction in fatalities. Which of the following would be most useful in evaluating the council's conclusion?

Correct Answer: (B)
(B) is correct. If YES (comparable cities also saw ~30% reduction) → fatalities declined everywhere → cameras may not be the cause → weakens. If NO (other cities saw no improvement) → cameras likely contributed → strengthens. (A) describes placement strategy — doesn't evaluate causation. (C) is about enforcement mechanism — doesn't directly test the causal claim. (D) asks about the measurement type — could be relevant but less directly tied to the causal question. (E) introduces concurrent measures — a relevant confound. (B) is broader and more directly evaluates the causal claim.
Question 5 of 10 GMAT Verbal

A company introduced a four-day workweek and found that employee productivity, measured by output per employee per week, remained unchanged. Management concluded that a four-day workweek is just as productive as a five-day workweek. Which of the following would be most useful in evaluating this conclusion?

Correct Answer: (B)
(B) is correct. Two-direction test: If YES (the productivity measure did capture all task types) → the unchanged result is valid → strengthens. If NO (some collaborative tasks were harder in four days) → productivity may have actually declined in ways the measure missed → weakens. (A) is about preference, not productivity. (C) is about industry trends — doesn't evaluate the productivity claim. (D) absenteeism is an adjacent metric, not productivity. (E) is about competitors — doesn't evaluate this company's specific claim.
Question 6 of 10 GMAT Verbal

A school district implemented a new reading curriculum and reported that average reading test scores rose by 15 points over two years. The superintendent concluded that the new curriculum caused the improvement. Which of the following would be most useful in evaluating the superintendent's conclusion?

Correct Answer: (A)
(A) is correct. If YES (other districts also improved by ~15 points) → improvement might be from broader factors (better test prep, teacher experience) → weakens. If NO (other districts didn't improve) → the new curriculum likely caused the gain → strengthens. (B) about training → a potential strengthener but not two-directional enough. (C) about test bias → relevant but less central than the alternative-cause question. (D) about what the score means, not causation. (E) about parent behavior → introduces a confounder but doesn't directly test the causal claim as cleanly.
Question 7 of 10 GMAT Verbal

A tech company launched a mentorship program pairing junior engineers with senior engineers. Six months later, junior engineer retention improved from 70% to 85%. The HR director concluded that the mentorship program caused the retention improvement. Which of the following would be most useful in evaluating the HR director's conclusion?

Correct Answer: (B)
(B) is correct. This addresses the central alternative-cause question: was this a market-wide trend (job market tightened, making all engineers less likely to leave)? Yes → weakens. No → strengthens. (A) is about program structure — could affect results but is a weaker evaluation. (C) is about costs to senior engineers — doesn't evaluate causation. (D) is about statistical significance — relevant but less central than the alternative-cause question. (E) is about promotions — an adjacent outcome, not causation.
Question 8 of 10 GMAT Verbal

A hospital introduced mandatory daily team briefings for medical staff and reported a 25% reduction in medical errors over the subsequent year. Administration concluded that the briefings caused the error reduction. Which of the following would be most useful in evaluating the administration's conclusion?

Correct Answer: (A)
(A) is correct. Yes → other hospitals also improved → briefings may not be the cause → weakens. No → only this hospital improved → briefings likely contributed → strengthens. (E) also identifies a potential confounder (other initiatives) and is a strong choice, but (A) tests the external benchmark more directly. (B) about timing. (C) about staff perception. (D) about where errors decreased — relevant to generalizability but not causation.
Question 9 of 10 GMAT Verbal

A retail chain began playing classical music in all its stores and found that average transaction value (amount spent per customer visit) increased by 12% over the following quarter. Marketing management concluded that the classical music caused customers to spend more. Which of the following would be most useful in evaluating this conclusion?

Correct Answer: (B)
(B) is correct. Two-direction test: If YES (a new product line explains the higher transaction values) → music is not the cause → weakens significantly. If NO (no new product lines; only music changed) → music is more likely the cause → strengthens. (A) is about awareness — relevant to mechanism but not causal attribution. (C) is about preference — doesn't address causation. (D) is about downstream profitability — doesn't address whether music caused the spending change. (E) is about volume consistency — a minor implementation detail.
Question 10 of 10 GMAT Verbal

An insurance company reduced its claims processing time from 14 days to 6 days after implementing a new software platform. Management concluded that the software caused the improvement. Which of the following would be most useful in evaluating management's conclusion?

Correct Answer: (B)
(B) is correct. If YES (more staff were hired concurrently) → additional personnel could explain the faster processing → weakens. If NO (staff count unchanged) → software is more likely the cause → strengthens. (A) about vendor reputation — doesn't evaluate whether software was the cause. (C) about satisfaction — downstream outcome, not causation. (D) about industry benchmarks — doesn't test causation. (E) about industry adoption — doesn't address whether this company's implementation caused the improvement.
10

Key Takeaways

1. The two-direction test is mandatory

Apply yes/no to every choice. The correct answer affects the argument differently depending on the response.

2. Target the central assumption

The most useful evaluating information addresses the argument's core causal bridge or assumption.

3. One-directional choices are wrong

If yes and no both strengthen (or both weaken), the choice cannot evaluate — eliminate it.

4. Alternative causes are pivotal

Questions asking whether an alternative explanation exists are among the strongest evaluate choices.

Lesson 11 All Verbal Lessons Lesson 13