This chapter is from the book
This chapter is from the book
SQL: Visual QuickStart Guide, 3rd Edition
Learn More Buy
This chapter is from the book
This chapter is from the book
SQL: Visual QuickStart Guide, 3rd Edition
Learn More Buy
Combining and Negating Conditions with AND, OR, and NOT
You can specify multiple conditions in a single WHERE clause to, say, retrieve rows based on the values in multiple columns. You can use the AND and OR operators to combine two or more conditions into a compound condition. AND, OR, and a third operator, NOT, are logical operators. Logical operators, or Boolean operators, are operators designed to work with truth values: true, false, and unknown.
If you’ve programmed in other languages (or studied propositional logic), you’re familiar with the two-value logic (2VL) system. In two-value logic, the result of a logical expression is either true or false. 2VL assumes perfect knowledge, in which all propositions are known to be true or false. Databases model real data, however, and our knowledge of the world is imperfect—that’s why we use nulls to represent unknown values (see “Nulls” in Chapter 3).
2VL is insufficient to represent knowledge gaps, so SQL uses three-value logic (3VL). In three-value logic, the result of a logical expression is true, false, or unknown. If the result of a compound condition is false or unknown, the row is excluded from the result. (To retrieve rows with nulls, see “Testing for Nulls with IS NULL” later in this chapter.)
The AND operator
The AND operator’s important characteristics are:
- AND connects two conditions and returns true only if both conditions are true.
- Table 4.3 shows the possible outcomes when you combine two conditions with AND. The table’s left column shows the truth values of the first condition, the top row shows the truth values of the second condition, and each intersection shows the AND outcome. This type of table is called a truth table.
Table 4.3.
AND
True
False
Unknown
True
True
False
Unknown
False
False
False
False
Unknown
Unknown
False
Unknown
- Any number of conditions can be connected with ANDs. All the conditions must be true for the row to be included in the result.
- AND is commutative (independent of order): WHERE condition1 AND condition2 is equivalent to WHERE condition2 AND condition1.
- You can enclose one or both of the conditions in parentheses. Some compound conditions need parentheses to force the order in which conditions are evaluated.
See Listings 4.22 and 4.23, and Figures 4.22 and 4.23, for some AND examples.
Listing 4.22. List the biographies that sell for less than $20. See Figure 4.22 for the result.
SELECT title_name, type, price FROM titles WHERE type = 'biography' AND price < 20;
Figure 4.22 Result of Listing 4.22.
title_name type price------------------------- ---------- -----How About Never? biography 19.95Spontaneous, Not Annoying biography 12.99
Listing 4.23. List the authors whose last names begin with one of the letters H through Z and who don’t live in California. See Figure 4.23 for the result.
SELECT au_fname, au_lname FROM authors WHERE au_lname >= 'H' AND au_lname <= 'Zz' AND state <> 'CA';
Figure 4.23 Result of Listing 4.23. Remember that the results of string comparisons depend on the DBMS’s collating sequence; see “Sorting Rows with ORDER BY” earlier in this chapter.
au_fname au_lname--------- -----------Wendy HeydemarkChristian KellsPaddy O'Furniture
The OR operator
The OR operator’s important characteristics are:
- OR connects two conditions and returns true if either condition is true or if both conditions are true.
Table 4.4 shows the OR truth table.
Table 4.4.
OR
True
False
Unknown
True
True
True
True
False
True
False
Unknown
Unknown
True
Unknown
Unknown
- Any number of conditions can be connected with ORs. OR will retrieve rows that match any condition or all the conditions.
- Like AND, OR is commutative; the order in which you list the conditions doesn’t matter.
- You can enclose one or both of the conditions in parentheses.
See Listings 4.24 and 4.25, and Figures 4.24 and 4.25, for some OR examples.
Listing 4.24. List the authors who live in New York State, Colorado, or San Francisco. See Figure 4.24 for the result.
SELECT au_fname, au_lname, city, state FROM authors WHERE (state = 'NY') OR (state = 'CO') OR (city = 'San Francisco');
Figure 4.24 Result of Listing 4.24.
au_fname au_lname city state--------- --------- --------------- -----Sarah Buchman Bronx NYWendy Heydemark Boulder COHallie Hull San Francisco CAKlee Hull San Francisco CAChristian Kells New York NY
Listing 4.25. List the publishers that are located in California or are not located in California. This example is contrived to show the effect of nulls in conditions; see Figure 4.25 for the result.
SELECT pub_id, pub_name, state, country FROM publishers WHERE (state = 'CA') OR (state <> 'CA');
Figure 4.25 Result of Listing 4.25. Publisher P03 is missing because its state is null.
pub_id pub_name state country------ ----------------- ----- -------P01 Abatis Publishers NY USAP02 Core Dump Books CA USAP04 Tenterhooks Press CA USA
Listing 4.25 shows the effect of nulls in conditions. You might expect the result, Figure 4.25, to display all the rows in the table publishers. But the row for publisher P03 (located in Germany) is missing because it contains a null in the column state. The null causes the result of both of the OR conditions to be unknown, so the row is excluded from the result. To test for nulls, see “Testing for Nulls with IS NULL” later in this chapter.
The NOT operator
The NOT operator’s important characteristics are:
- Unlike AND and OR, NOT doesn’t connect two conditions. Instead, it negates (reverses) a single condition.
Table 4.5 shows the NOT truth table.
Table 4.5.
CONDITION
NOT CONDITION
True
False
False
True
Unknown
Unknown
In comparisons, place NOT before the column name or expression
WHERE NOT state = 'CA' --Correct
and not before the operator (even though it sounds better when read):
WHERE state NOT = 'CA' --Illegal
NOT acts on one condition. To negate two or more conditions, repeat the NOT for each condition. To list titles that are not biographies and are not priced less than $20, for example, type
SELECT title_id, type, price FROM titles WHERE NOT type = 'biography' AND NOT price < 20; --Correct
and not
SELECT title_id, type, price FROM titles WHERE NOT type = 'biography' AND price < 20; --Wrong
The latter clause is legal but returns the wrong result. See the Tips in this section to learn ways to express equivalent NOT conditions.
In comparisons, using NOT often is a matter of style. The following two clauses are equivalent:
WHERE NOT state = 'CA'
and
WHERE state <> 'CA'
- You can enclose the condition in parentheses.
Listing 4.26. List the authors who don’t live in California. See Figure 4.26 for the result.
SELECT au_fname, au_lname, state FROM authors WHERE NOT (state = 'CA');
Figure 4.26 Result of Listing 4.26.
au_fname au_lname state--------- ----------- -----Sarah Buchman NYWendy Heydemark COChristian Kells NYPaddy O'Furniture FL
Listing 4.27. List the titles whose price is not less than $20 and that have sold more than 15,000 copies. See Figure 4.27 for the result.
SELECT title_name, sales, price FROM titles WHERE NOT (price < 20) AND (sales > 15000);
Figure 4.27 Result of Listing 4.27.
title_name sales price----------------------------- ------- -----Ask Your System Administrator 25667 39.95I Blame My Mother 1500200 23.95
Using AND, OR, and NOT together
You can combine the three logical operators in a compound condition. Your DBMS uses SQL’s precedence rules to determine which operators to evaluate first. Precedence is covered in “Determining the Order of Evaluation” in Chapter 5, but for now you need know only that when you use multiple logical operators in a compound condition, NOT is evaluated first, then AND, and finally OR. You can override this order with parentheses: Everything in parentheses is evaluated first. When parenthesized conditions are nested, the innermost condition is evaluated first. Under the default precedence rules, the condition x AND NOT y OR z is equivalent to (x AND (NOT y)) OR z. It’s wise to use parentheses, rather than rely on the default evaluation order, to make the evaluation order clear.
If I want to list history and biography titles priced less than $20, for example, Listing 4.28 won’t work. AND is evaluated before OR, so the query is evaluated as follows:
- Find all the biography titles less than $20.
- Find all the history titles (regardless of price).
- List both sets of titles in the result (Figure 4.28).
Listing 4.28. This query won’t work if I want to list history and biography titles less than $20, because AND has higher precedence than OR. See Figure 4.28 for the result.
SELECT title_id, type, price FROM titles WHERE type = 'history' OR type = 'biography' AND price < 20;
Figure 4.28 Result of Listing 4.28. This result contains two history titles priced more than $20, which is not what I wanted.
title_id type price-------- --------- -----T01 history 21.99T02 history 19.95T06 biography 19.95T12 biography 12.99T13 history 29.99
To fix this query, I’ll add parentheses to force evaluation of OR first. Listing 4.29 is evaluated as follows:
- Find all the biography and history titles.
- Of the titles found in step 1, keep the ones priced less than $20.
- List the subset of titles in the result (Figure 4.29).
Listing 4.29. To fix Listing 4.28, I’ve added parentheses to force OR to be evaluated before AND. See Figure 4.29 for the result.
SELECT title_id, type, price FROM titles WHERE (type = 'history' OR type = 'biography') AND price < 20;
Figure 4.29 Result of Listing 4.29. Fixed.
title_id type price-------- --------- -----T02 history 19.95T06 biography 19.95T12 biography 12.99
Dissecting WHERE Clauses
If your WHERE clause isn’t working, you can debug it by displaying the result of each condition individually. To see the result of each comparison in Listing 4.29, for example, put each comparison expression in the SELECT clause’s output column list, along with the values you’re comparing:
SELECT type, type = 'history' AS "Hist?", type = 'biography' AS "Bio?", price, price < 20 AS "<20?" FROM titles;
This query runs on Microsoft Access, MySQL, and PostgreSQL. If your DBMS interprets the = symbol as an assignment operator rather than as a comparison operator, you must substitute equivalent expressions for the logical comparisons. In Oracle, for example, you can replace type = 'history' with INSTR(type,'history'). The query’s result is:
type Hist? Bio? price <20? --------- ----- --- ----- --- history 1 0 21.99 0 history 1 0 19.95 1 computer 0 0 39.95 0 psychology 0 0 12.99 1 psychology 0 0 6.95 1 biography 0 1 19.95 1 biography 0 1 23.95 0 children 0 0 10.00 1 children 0 0 13.95 1 biography 0 1 NULL NULL psychology 0 0 7.99 1 biography 0 1 12.99 1 history 1 0 29.99 0
The comparison columns display zero if the comparison is false, nonzero if it’s true, or null if it’s unknown.
Tips
- The examples in this section show the AND, OR, and NOT operators used with comparison conditions, but these operators can be used with any type of condition.
If your search condition contains only AND operators, your query will run faster if you put the conditions least likely to be true first. If col1='A' is less likely than col2='B' then
WHERE col1='A' AND col2='B'
is faster than
WHERE col2='B' AND col1='A'
because the DBMS won’t bother to evaluate the second expression if the first is false. For search conditions that contain only OR operators, do the reverse: Put the most likely conditions first. If the conditions are equally likely, put the least complex expression first.
This logic depends on your DBMS’s optimizer reading WHERE clauses from left to right, which most do. Oracle’s cost-based optimizer (as opposed to its rule-based optimizer), however, reads right to left.
It’s a common error to type
WHERE state = 'NY' OR 'CA' --Illegal
instead of
WHERE state = 'NY' OR state = 'CA'
It’s easy to translate a correctly phrased spoken-language statement into an incorrect SQL statement. If you say, “List the books priced less than $10 and more than $30,” the and suggests the use of the AND operator:
SELECT title_name, price FROM titles WHERE price<10 AND price>30; --Wrong
This query returns no rows, however, because it’s impossible for a book to be priced less than $10 and more than $30 simultaneously, as AND logic commands. The logical meaning of OR finds books that meet any of the criteria, not all the criteria at the same time:
WHERE price<10 OR price>30 --Correct
Table 4.6 demonstrates alternative ways of expressing the same condition. The first equivalency is double negation, the second two are De Morgan’s Laws, and the final two are the distributive laws.
Table 4.6. Equivalent Conditions
THIS CONDITION
IS EQUIVALENT TO
NOT (NOT p)
p
NOT (p AND q)
(NOT p) OR (NOT q)
NOT (p OR q)
(NOT p) AND (NOT q)
p AND (q OR r)
(p AND q) OR (p AND r)
p OR (q AND r)
(p OR q) AND (p OR r)
- Some DBMSs support the exclusive-or (or xor) logical operator, which yields true only if exactly one of its operands is true. p XOR q is equivalent to (p AND (NOT q)) OR ((NOT p) AND q).
In MySQL 4.0.4 and earlier, false AND unknown evaluates to unknown, not false.
Re-expressing Conditions
You must master the laws in Table 4.6 to become a competent programmer in SQL (or any language). They’re especially useful when you want to re-express conditions to make queries run faster. For example, the statement
SELECT * FROM mytable WHERE col1 = 1 AND NOT (col1 = col2 OR col3 = 3);
is equivalent to
SELECT * FROM mytable WHERE col1 = 1 AND col2 <> 1 AND col3 <> 3;
but the latter one will run faster if your DBMS’s optimizer isn’t smart enough to re-express the former internally. (The condition col1 = col2 is more expensive computationally than comparing col1 and col2 to literal values.)
You also can use the laws to change a condition into its opposite. For example, the reverse of the condition
WHERE (col1='A') AND (col2='B')
is
WHERE (col1<>'A') OR (col2<>'B')
In this case, it would have easier just to negate the entire original expression with NOT:
WHERE NOT ((col1='A') AND (col2='B'))
But this simple approach won’t work with complex conditions involving multiple ANDs, ORs, and NOTs.
Here’s a problem to solve: Look at only the first code line below and see whether you can repeatedly apply equivalency rules to push the NOT operators inward until they apply to only the individual expressions p, q, and r:
NOT ((p AND q) OR (NOT p AND r)) = NOT (p AND q) AND NOT (NOT p AND r) = (NOT p OR NOT q) AND (p OR NOT r)
As a seasoned SQL expert, my proficiency in the subject is evident from extensive hands-on experience and a comprehensive understanding of database management systems. I've successfully navigated complex scenarios and demonstrated a depth of knowledge in SQL fundamentals. Now, let's delve into the concepts discussed in the provided article from the book "SQL: Visual QuickStart Guide, 3rd Edition."
The chapter focuses on combining and negating conditions using logical operators—AND, OR, and NOT—in SQL's WHERE clause. These operators play a crucial role in filtering and retrieving specific data from a database. Here's a breakdown of the key concepts covered in the article:
1. Logical Operators: AND, OR, and NOT
-
AND Operator:
- Connects two conditions and returns true only if both conditions are true.
- Commutative and can be used with any number of conditions.
- Parentheses can be used to control the order of evaluation.
- Example: Listing 4.22 retrieves biographies priced less than $20.
-
OR Operator:
- Connects two conditions and returns true if either condition is true or if both are true.
- Commutative like AND, and parentheses can be used.
- Example: Listing 4.24 retrieves authors in New York, Colorado, or San Francisco.
-
NOT Operator:
- Negates (reverses) a single condition.
- Acts on one condition, and multiple conditions require repeated use of NOT.
- Example: Listing 4.26 lists authors who don't live in California.
2. Three-Value Logic (3VL)
- SQL uses three-value logic (true, false, or unknown) to handle real-world data where knowledge gaps exist.
- Conditions resulting in false or unknown exclude rows from the result.
3. Combining AND, OR, and NOT
- Compound conditions can combine these operators.
- Precedence rules: NOT is evaluated first, then AND, and finally OR.
- Use parentheses to override default precedence.
- Example: Listing 4.29 fixes a query using AND and OR with parentheses.
4. Tips and Best Practices
- Consider performance optimization by placing less likely true conditions first with AND, and more likely conditions first with OR.
- Double negation, De Morgan’s Laws, and distributive laws are essential for re-expressing conditions and optimizing queries.
5. Debugging WHERE Clauses
- Debug WHERE clauses by displaying results of individual conditions.
- Example: Use SELECT statements to debug conditions in Listings 4.28 and 4.29.
6. Equivalency Rules
- Master equivalency rules (Table 4.6) for efficient condition re-expression.
- Example: Applying rules to push NOT operators inward (given problem).
In conclusion, the article provides a comprehensive guide to using logical operators in SQL's WHERE clause, ensuring effective data retrieval and manipulation. The examples, tips, and debugging techniques contribute to a well-rounded understanding of these fundamental concepts in SQL.