Type Safety Beyond Compile Time: An Introduction to Zod

Published on June 17, 2024

Tags: Zod TypeScript Validation Schema Type Safety Data Validation API Frontend Backend

TypeScript offers incredible benefits with its static type system, catching errors at compile time and improving developer productivity. However, TypeScript’s guarantees often end at your application’s boundaries. What happens when data comes in from an external API, a user form, or a configuration file? How can you be sure that runtime data actually matches the TypeScript types you expect?

Enter Zod, a TypeScript-first schema declaration and validation library that bridges this gap beautifully.

What is Zod?

Zod allows you to define schemas for your data structures, from simple primitives like strings and numbers to complex nested objects and arrays. These schemas serve two primary purposes:

Schema Declaration: They provide a clear, concise way to define the expected shape and types of your data.
Runtime Validation: They can parse and validate unknown data at runtime, ensuring it conforms to the defined schema.

Crucially, Zod is designed with TypeScript integration as a top priority. You don’t need separate validation logic and TypeScript types; Zod schemas are the single source of truth.

Why Use Zod?

True Runtime Safety: Catch errors caused by unexpected data structures before they wreak havoc in your application logic. No more undefined is not a function errors because an optional API field was missing!
Single Source of Truth: Define your data shape once using a Zod schema, and automatically infer the corresponding TypeScript type. No more keeping types and validation rules manually in sync.
Excellent Developer Experience: Zod’s API is fluent, intuitive, and highly composable. Error messages are detailed and pinpoint exactly where validation failed.
TypeScript First: Seamlessly integrates with TypeScript, inferring static types directly from your runtime schemas.
Feature Rich: Supports complex validation scenarios including optional fields, default values, arrays, unions, intersections, transformations, refinements (custom validation rules), and much more.

Core Concepts with Examples

Let’s look at how Zod works using the provided examples.

Defining Schemas

You define schemas using Zod’s built-in functions (z.string(), z.number(), z.object(), z.array(), etc.) and chainable modifiers (.optional(), .default(), .min(), .max(), etc.).

import { z } from 'zod';

// Define a schema for a single value object
const MyValueSchema = z.object({
  // 'name' must be a string and is required
  name: z.string(),

  // 'phone' must be a string, but it's optional (can be undefined or missing)
  phone: z.string().optional(),

  // 'keywords' must be an array of strings.
  // If the 'keywords' property is missing on the INPUT data during parsing,
  // it will default to an empty array in the OUTPUT.
  keywords: z.array(z.string()).default([]),
});

// Define a schema for an object containing an array of MyValue objects
const MyValueArraySchema = z.object({
  // 'results' must be an array containing objects that match MyValueSchema
  results: z.array(MyValueSchema),
});

Here:

z.object({}) defines the shape of an object.
z.string() requires a string value.
.optional() makes a field optional (it can be undefined in the input).
z.array(z.string()) defines an array where each element must be a string.
.default([]) provides a default value if the field is missing during parsing.

Parsing and Validation

Once you have a schema, you can use it to parse and validate unknown data. Zod offers two main methods:

.parse(data): Validates the data. If validation fails, it throws a ZodError with detailed information. If successful, it returns the validated (and potentially transformed, e.g., by .default()) data.
.safeParse(data): Validates the data but does not throw an error on failure. Instead, it returns an object with either { success: true, data: validatedData } or { success: false, error: ZodError }. This is useful for handling validation errors gracefully.

// Example data coming from an API (potentially unsafe)
const validApiData = {
  results: [
    { name: 'Alice', phone: '123-456-7890', keywords: ['dev', 'ts'] },
    { name: 'Bob' }, // 'phone' is optional, 'keywords' will get default []
  ],
};

const invalidApiData = {
  results: [
    { name: 'Charlie', phone: 12345 }, // phone is not a string
    { keywords: [true] } // name is missing, keyword element is not a string
  ],
};

// --- Using .parse() ---
try {
  const validatedData = MyValueArraySchema.parse(validApiData);
  console.log("Parse Success:", validatedData);
  // Output: Parse Success: { results: [ { name: 'Alice', phone: '123-456-7890', keywords: [ 'dev', 'ts' ] }, { name: 'Bob', keywords: [] } ] }
  // Note: Bob's object now has the default keywords array

  // This will throw an error
  // MyValueArraySchema.parse(invalidApiData);

} catch (error) {
    if (error instanceof z.ZodError) {
        console.error("Parse Error:", error.errors);
    }
}

// --- Using .safeParse() ---
const safeResultValid = MyValueArraySchema.safeParse(validApiData);
if (safeResultValid.success) {
  console.log("SafeParse Success:", safeResultValid.data);
   // Output: SafeParse Success: { results: [ { name: 'Alice', phone: '123-456-7890', keywords: [ 'dev', 'ts' ] }, { name: 'Bob', keywords: [] } ] }
}

const safeResultInvalid = MyValueArraySchema.safeParse(invalidApiData);
if (!safeResultInvalid.success) {
  console.error("SafeParse Error Details:", safeResultInvalid.error.flatten());
  /* Output similar to:
  SafeParse Error Details: {
    formErrors: [],
    fieldErrors: {
      results: [
        'Expected string, received number at path results[0].phone',
        'Required at path results[1].name',
        'Expected string, received boolean at path results[1].keywords[0]'
      ]
    }
  }
  */
}

Inferring TypeScript Types

This is where Zod truly shines. You can infer static TypeScript types directly from your schemas using z.infer.

// Infer the TypeScript type represented by the schema AFTER parsing/validation
// This is the type you'll typically use in your application code.
type MyValueArrayOutput = z.infer<typeof MyValueArraySchema>;

/*
MyValueArrayOutput will be equivalent to:
{
  results: {
    name: string;
    phone?: string | undefined; // optional modifier makes it optional
    keywords: string[];         // default guarantees it's always string[] in the output
  }[];
}
*/

// You can now use this type with full type safety:
function processResults(data: MyValueArrayOutput) {
  for (const item of data.results) {
    console.log(`Processing user: ${item.name}`);
    // item.keywords is guaranteed to be string[] here
    item.keywords.forEach(kw => console.log(` - Keyword: ${kw}`));
    if (item.phone) {
      console.log(` Phone: ${item.phone}`);
    }
  }
}

// Assuming validation passed earlier:
const validatedData = MyValueArraySchema.parse(validApiData);
processResults(validatedData); // Works perfectly!

z.infer gives you the type of the data after it has been successfully parsed and transformed by Zod (including applying defaults, etc.). This is usually the type you want for the rest of your application logic.

Input vs. Output Types (`z.input` vs `z.infer`)

Sometimes, the type of data you expect as input differs slightly from the type you get after validation and transformation. This is particularly relevant when using modifiers like .default(), .preprocess(), or .transform().

z.infer<typeof Schema>: Infers the output type (after validation/transformation).
z.input<typeof Schema>: Infers the input type (before validation/transformation).

Let’s look at our MyValueArraySchema:

// The type expected as INPUT before parsing
type MyValueArrayInput = z.input<typeof MyValueArraySchema>;

/*
MyValueArrayInput will be equivalent to:
{
  results: {
    name: string;
    phone?: string | undefined;
    keywords?: string[] | undefined; // Input might not have 'keywords', default applies during parse
  }[];
}
*/

// The type guaranteed as OUTPUT after parsing
type MyValueArrayOutput = z.infer<typeof MyValueArraySchema>;

/*
MyValueArrayOutput is (as seen before):
{
  results: {
    name: string;
    phone?: string | undefined;
    keywords: string[]; // Output ALWAYS has 'keywords' because of .default([])
  }[];
}
*/

Notice the difference in the keywords property. The Input type reflects that keywords might be missing initially, while the Output type guarantees it will be present (as an empty array if it was missing) because of the .default([]) modifier. You typically use the Input type if you need to type data before passing it to Zod’s .parse() or .safeParse() methods.

Conclusion

Zod provides an elegant and robust solution for runtime data validation in TypeScript applications. By defining schemas that serve as both validation rules and the source for static types (z.infer), Zod eliminates inconsistencies, enhances type safety across application boundaries, and improves the overall developer experience. Whether you’re dealing with API responses, user input, or configuration files, Zod is an invaluable tool for building more reliable and maintainable TypeScript applications. Stop guessing what shape your data is in – use Zod to be sure!