SPSS Lesson 7 – Sorting and Filtering Data | Dataplexa

Sorting and Filtering Data

Once data has been entered and cleaned, the next important step is organizing it in a meaningful way. Sorting and filtering allow you to explore datasets, identify patterns, and focus on specific subsets of data.

In SPSS, sorting changes the order of cases, while filtering temporarily selects only certain cases for analysis without deleting any data.


Why Sorting and Filtering Are Important

Large datasets often contain hundreds or thousands of records. Without sorting and filtering, finding relevant information can be difficult and time-consuming.

These tools help you:

  • Identify highest or lowest values
  • Group similar cases together
  • Analyze specific subgroups
  • Perform focused statistical analysis

Professionals rely on sorting and filtering before almost every analysis.


Example Dataset

Consider the following employee dataset:

Employee_ID Department Age Monthly_Salary
301 Sales 29 42000
302 HR 35 38000
303 IT 26 55000
304 Sales 41 60000

This dataset can be explored more effectively using sorting and filtering.


Sorting Data in SPSS

Sorting arranges cases in ascending or descending order based on one or more variables.

For example, sorting by salary helps identify the highest and lowest paid employees.

Sorting can be done:

  • In ascending order (lowest to highest)
  • In descending order (highest to lowest)

Sorting changes only the display order, not the actual values.


Filtering Data in SPSS

Filtering selects only specific cases that meet a condition. Other cases remain in the dataset but are excluded from analysis.

For example, you may want to analyze:

  • Only employees from the IT department
  • Employees earning more than 50,000

Filtering is reversible and does not delete data, making it safe for exploratory analysis.


Using SPSS Syntax for Sorting

Sorting can be performed using syntax for repeatable and precise operations.


SORT CASES BY Monthly_Salary (D).
EXECUTE.

This command sorts employees from highest to lowest salary.


Using SPSS Syntax for Filtering

Filtering is often used to focus analysis on a specific group.


FILTER BY (Monthly_Salary > 50000).
EXECUTE.

This filter includes only employees earning more than 50,000 in analysis.

Filtered cases appear with a diagonal line in Data View, indicating they are excluded.


Real-World Use Case

In a company salary review, HR may first filter employees by department, then sort salaries to identify top earners and potential outliers.

Sorting and filtering together enable faster and more accurate decisions.


Quiz 1

What does sorting change in SPSS?

The order of cases, not the data values.


Quiz 2

What happens to filtered cases?

They remain in the dataset but are excluded from analysis.


Quiz 3

Which operation is reversible?

Filtering.


Quiz 4

Why is syntax useful for sorting?

It allows consistent and repeatable operations.


Quiz 5

What visual indicator shows filtered cases?

A diagonal line across the case number.


Mini Practice

Using an employee dataset:

  • Sort employees by Age in ascending order
  • Filter employees with Salary greater than 40,000

Observe how the Data View changes and how filtered cases are displayed.

Use Data → Sort Cases for sorting, and Data → Select Cases or FILTER syntax for filtering.


What’s Next

In the next lesson, you will learn how to recode values, which allows you to transform existing data into new categories for analysis.