Oracle DataBase Lesson 38 – Oracle Data Pump (Export & Import) | Dataplexa

Data Pump (Import & Export)

Oracle Data Pump is the standard tool for moving data and schema objects into and out of an Oracle database. It replaces the older exp and imp utilities and is dramatically faster — Data Pump runs inside the database server itself, reads and writes directly to server-side files, and can parallelise work across multiple streams. It is used for migrations between databases, schema cloning, refreshing test environments from production, and archiving historical data.

How Data Pump Works

Data Pump exports produce dump files — Oracle's proprietary binary format that captures both data and metadata (table definitions, indexes, constraints, grants, sequences). These files are written to and read from a directory object — a named Oracle object that points to a physical path on the database server's filesystem. You cannot specify a raw file path in a Data Pump command; you must always reference a directory object. The two executables are expdp (export) and impdp (import) — both are run from the operating system command line, not from SQL.

Data Pump runs as a job inside the database — you can monitor, pause, and restart it without losing progress
A directory object must exist and the user must have READ and WRITE privilege on it before exporting or importing
Dump files can only be read by Data Pump — they are not plain text or CSV files
Data Pump can export and import at four levels: full database, schema, tablespace, or individual table
The PARALLEL parameter runs multiple worker processes simultaneously — speeds up large exports and imports significantly

Creating a Directory Object

Before running any Data Pump operation, a DBA must create a directory object pointing to an existing filesystem path on the database server and grant the exporting or importing user the appropriate privileges on it.

-- Create a directory object pointing to an existing server-side path
-- The path must already exist on the filesystem -- Oracle does not create it
CREATE OR REPLACE DIRECTORY dp_dir AS '/u01/app/oracle/datapump';
-- Grant the Data Pump user READ and WRITE on the directory
GRANT READ, WRITE ON DIRECTORY dp_dir TO dataplexa_store;
-- Verify the directory object exists
SELECT directory_name, directory_path
FROM   dba_directories
WHERE  directory_name = 'DP_DIR';

-- CREATE DIRECTORY
Directory created.

-- GRANT
Grant succeeded.

-- DBA_DIRECTORIES
DIRECTORY_NAME | DIRECTORY_PATH
---------------|------------------------------
DP_DIR | /u01/app/oracle/datapump
(1 row selected)

The directory object name is stored in uppercase in DBA_DIRECTORIES — queries must match exactly
The filesystem path must be accessible by the Oracle OS user (oracle) — if Oracle cannot write to it, the export fails immediately
A single directory object can be reused for multiple export and import operations — there is no need to create a new one each time
Oracle provides a default directory called DATA_PUMP_DIR pointing to a standard path — use it for quick tests but create a dedicated directory for production work

Exporting with expdp

All expdp commands are run from the OS command line as a user with the required database privileges. The most common export modes are schema-level (exports everything owned by a user) and table-level (exports specific tables). The LOGFILE parameter is always worth specifying — it captures the full output of the export job including any errors, written to the same directory object.

-- Export an entire schema -- run from the OS command line as oracle user
-- expdp system/password SCHEMAS=dataplexa_store DIRECTORY=dp_dir DUMPFILE=store_schema.dmp LOGFILE=store_export.log
-- Export specific tables only
-- expdp system/password TABLES=dataplexa_store.orders,dataplexa_store.customers DIRECTORY=dp_dir DUMPFILE=orders_customers.dmp LOGFILE=tables_export.log
-- Export with parallel workers and a compressed dump file (Enterprise Edition)
-- expdp system/password SCHEMAS=dataplexa_store DIRECTORY=dp_dir DUMPFILE=store_%U.dmp PARALLEL=4 COMPRESSION=ALL LOGFILE=store_parallel.log
-- Export only rows matching a condition (QUERY parameter)
-- expdp system/password TABLES=dataplexa_store.orders DIRECTORY=dp_dir DUMPFILE=orders_2024.dmp QUERY=dataplexa_store.orders:'"WHERE order_date >= DATE '"'"'2024-01-01'"'"'"' LOGFILE=orders_2024.log
-- Monitor a running Data Pump job from SQL
SELECT job_name, state, degree, attached_sessions
FROM   dba_datapump_jobs
WHERE  state != 'NOT RUNNING';

-- Schema export output (abbreviated)
Export: Release 19.0.0.0.0
Connected to: Oracle Database 19c Enterprise Edition
Starting "SYSTEM"."SYS_EXPORT_SCHEMA_01"
Estimate in progress using BLOCKS method...
Processing object type SCHEMA_EXPORT/TABLE/TABLE_DATA
Processing object type SCHEMA_EXPORT/TABLE/INDEX/INDEX
Processing object type SCHEMA_EXPORT/TABLE/CONSTRAINT/CONSTRAINT
. . exported "DATAPLEXA_STORE"."ORDERS" 1.2 MB 148 rows
. . exported "DATAPLEXA_STORE"."CUSTOMERS" 320 KB 42 rows
. . exported "DATAPLEXA_STORE"."PRODUCTS" 180 KB 18 rows
Job "SYSTEM"."SYS_EXPORT_SCHEMA_01" successfully completed

The %U in the dumpfile name is replaced with a two-digit number — store_01.dmp, store_02.dmp etc — required when using PARALLEL to allow multiple files to be written simultaneously
COMPRESSION=ALL compresses both data and metadata — typically reduces dump file size by 60–80% at a modest CPU cost
DBA_DATAPUMP_JOBS shows all running and recently completed Data Pump jobs — useful for monitoring long-running exports
The export log file is the first place to look if an export completes but objects are missing — warnings about skipped objects are written there

Importing with impdp

Importing reads from a dump file and recreates the objects and data in the target database. The most important import parameters are REMAP_SCHEMA — which redirects objects from one schema to another, essential when importing into a different user — and REMAP_TABLESPACE — which redirects objects to a different tablespace when the source and target have different tablespace layouts. Use TABLE_EXISTS_ACTION to control what happens when a table already exists in the target.

-- Full schema import into the same schema name
-- impdp system/password SCHEMAS=dataplexa_store DIRECTORY=dp_dir DUMPFILE=store_schema.dmp LOGFILE=store_import.log
-- Import into a different schema -- REMAP_SCHEMA redirects all objects
-- impdp system/password DIRECTORY=dp_dir DUMPFILE=store_schema.dmp REMAP_SCHEMA=dataplexa_store:store_uat LOGFILE=uat_import.log
-- Import and redirect objects to a different tablespace
-- impdp system/password DIRECTORY=dp_dir DUMPFILE=store_schema.dmp REMAP_SCHEMA=dataplexa_store:store_uat REMAP_TABLESPACE=users:uat_data LOGFILE=uat_import.log
-- Import specific tables only -- useful for restoring a single table from a full schema dump
-- impdp system/password DIRECTORY=dp_dir DUMPFILE=store_schema.dmp TABLES=dataplexa_store.orders LOGFILE=orders_import.log
-- TABLE_EXISTS_ACTION options:
-- SKIP (default) -- skip the table if it exists
-- APPEND         -- add rows without truncating existing data
-- TRUNCATE       -- truncate then load
-- REPLACE        -- drop, recreate, then load
-- impdp system/password DIRECTORY=dp_dir DUMPFILE=orders_customers.dmp TABLE_EXISTS_ACTION=APPEND LOGFILE=append_import.log

-- Schema import output (abbreviated)
Import: Release 19.0.0.0.0
Connected to: Oracle Database 19c Enterprise Edition
Starting "SYSTEM"."SYS_IMPORT_SCHEMA_01"
Processing object type SCHEMA_EXPORT/TABLE/TABLE
Processing object type SCHEMA_EXPORT/TABLE/TABLE_DATA
. . imported "STORE_UAT"."ORDERS" 148 rows
. . imported "STORE_UAT"."CUSTOMERS" 42 rows
. . imported "STORE_UAT"."PRODUCTS" 18 rows
Processing object type SCHEMA_EXPORT/TABLE/INDEX/INDEX
Processing object type SCHEMA_EXPORT/TABLE/CONSTRAINT/CONSTRAINT
Job "SYSTEM"."SYS_IMPORT_SCHEMA_01" successfully completed

REMAP_SCHEMA=source:target redirects every object from the source schema to the target — the target schema must already exist or Data Pump will create it
REMAP_TABLESPACE=source:target is essential when the target database does not have tablespaces with the same names as the source
TABLE_EXISTS_ACTION=APPEND is the safest option for adding data to an existing table — it does not modify existing rows
You can import specific tables from a full database or full schema dump without needing to re-export — Data Pump reads only the relevant portions of the dump file

EXCLUDE and INCLUDE — Filtering What Is Exported

Large schemas often contain objects you do not need in a particular dump — statistics, audit trails, or specific tables that contain sensitive data. The EXCLUDE and INCLUDE parameters filter the export at the object type level, keeping dump files smaller and imports faster. Only one of the two can be used in a single operation.

-- Export the schema but exclude statistics (they will be regathered on import)
-- and exclude the audit log table which is large and not needed in UAT
-- expdp system/password SCHEMAS=dataplexa_store DIRECTORY=dp_dir DUMPFILE=store_no_stats.dmp EXCLUDE=STATISTICS EXCLUDE=TABLE:'"IN ('"'"'PRODUCT_PRICE_AUDIT'"'"')"' LOGFILE=exclude_export.log
-- Export only tables and their data -- exclude all other object types
-- expdp system/password SCHEMAS=dataplexa_store DIRECTORY=dp_dir DUMPFILE=tables_only.dmp INCLUDE=TABLE LOGFILE=include_export.log
-- Export only specific named tables using INCLUDE with a filter
-- expdp system/password SCHEMAS=dataplexa_store DIRECTORY=dp_dir DUMPFILE=core_tables.dmp INCLUDE=TABLE:'"IN ('"'"'ORDERS'"'"','"'"'CUSTOMERS'"'"','"'"'PRODUCTS'"'"')"' LOGFILE=core_export.log
-- Check the content of a dump file without importing it (SQLFILE writes DDL to a file)
-- impdp system/password DIRECTORY=dp_dir DUMPFILE=store_schema.dmp SQLFILE=dp_dir:schema_ddl.sql

-- Export with EXCLUDE=STATISTICS
Processing object type SCHEMA_EXPORT/TABLE/TABLE_DATA
Processing object type SCHEMA_EXPORT/TABLE/INDEX/INDEX
Processing object type SCHEMA_EXPORT/TABLE/CONSTRAINT/CONSTRAINT
-- STATISTICS object type skipped as per EXCLUDE parameter
Job completed successfully

-- SQLFILE parameter output
-- Writes DDL to schema_ddl.sql without importing any data
-- Useful for reviewing what a dump file contains before committing to an import

EXCLUDE=STATISTICS is almost always a good idea — statistics from the source database are rarely valid on a target with different data volumes, and excluding them reduces dump file size
The SQLFILE parameter on impdp extracts all DDL from a dump file to a SQL script without executing anything — use it to preview or audit what an import would create
INCLUDE and EXCLUDE accept object type names as defined in Oracle's Data Pump object type hierarchy — TABLE, INDEX, CONSTRAINT, GRANT, SEQUENCE, VIEW, PROCEDURE, and others
Single and double quote nesting in EXCLUDE and INCLUDE filter expressions is notoriously tricky at the OS command line — consider using a parameter file instead

Lesson Summary

Concept	What It Means
expdp	OS command-line export tool — writes data and metadata to a dump file
impdp	OS command-line import tool — reads a dump file and recreates objects and data
Directory object	Named Oracle object pointing to a server filesystem path — required for all Data Pump operations
REMAP_SCHEMA	Redirects all objects from source schema to a different target schema on import
REMAP_TABLESPACE	Redirects objects to a different tablespace on import — needed when source and target layouts differ
TABLE_EXISTS_ACTION	Controls import behaviour when a table already exists — SKIP, APPEND, TRUNCATE, or REPLACE
PARALLEL	Runs multiple worker processes simultaneously — requires %U in the dumpfile name
EXCLUDE / INCLUDE	Filter object types from an export or import — cannot use both in one operation
SQLFILE	impdp parameter that extracts DDL from a dump file to a SQL script without importing

Practice Questions

Practice 1. Why must you create a directory object before using Data Pump? Can you specify a raw filesystem path instead?

Practice 2. You are importing a production schema dump into a UAT database where the target schema is called store_uat. Which parameter do you use?

Practice 3. What does TABLE_EXISTS_ACTION=APPEND do, and how does it differ from TRUNCATE?

Practice 4. Why is EXCLUDE=STATISTICS almost always a good idea when exporting for a refresh of a UAT environment?

Practice 5. How do you preview the DDL that a dump file would create without actually importing anything?

Quiz

Quiz 1. You run an expdp command with PARALLEL=4 but use DUMPFILE=store.dmp (a single filename). What happens?

Data Pump automatically creates four files with numeric suffixes
Data Pump raises an error — %U is required in the filename when using PARALLEL
Data Pump silently falls back to PARALLEL=1
The export runs with four workers but writes sequentially to one file

Quiz 2. A DBA wants to restore only the orders table from a full schema dump taken last week. Is this possible without re-exporting?

No — you must re-export the orders table individually
Yes — impdp TABLES= reads only that table from the full dump file
Only if the dump was created with TABLE_EXISTS_ACTION=REPLACE
Only if the dump was compressed with COMPRESSION=ALL

Quiz 3. What is the key difference between Data Pump and the older exp/imp utilities?

Data Pump produces CSV; exp produces binary dump files
Data Pump only works on Enterprise Edition
Data Pump is server-side, faster, parallelisable, and restartable — exp/imp were client-side and slower
Data Pump exports include redo logs; exp does not

Quiz 4. You use INCLUDE=TABLE in an export. Which objects are excluded?

Only the table data is excluded — DDL is still exported
No objects are excluded — INCLUDE=TABLE is the default
Everything except tables — indexes, constraints, grants, sequences, views, procedures all excluded
System tables are excluded but all user tables are included

Quiz 5. The Oracle user running expdp has no privileges on the directory object. What happens?

Data Pump writes to the user's home directory instead
The export completes but the dump file is empty
The export fails — the user needs READ and WRITE on the directory object
Data Pump automatically grants the required privileges and continues

Next up — Oracle Cloud (OCI) — What Oracle Cloud Infrastructure is, how it differs from on-premises Oracle, and the key database services available including Autonomous Database.

← Previous Course Index Next →