How String Catalog Localization Works

String Catalog runs a parser-aware app localization automation pipeline for .xcstrings, Android strings.xml, i18next JSON, and TestFlight WhatToTest.<locale>.txt files. It scans your project (or accepts uploaded files), extracts and tracks string state, queues missing or needs-review translations, and writes normalized output back to disk. For repository projects, those changes are delivered through configurable repository automation instead of a one-size-fits-all sync path.

Beyond app strings, teams also use String Catalog for release notes, App Store and Google Play metadata, and human review workflows.

The pipeline is built to be:

Parser-aware (XCStrings, Android XML, i18next JSON, TestFlight)
Rename-aware
Review-state aware
Locale normalization aware
Safe to write back into source control

This page explains the full localization workflow end-to-end.

High-Level Pipeline

For repository-based projects, the pipeline typically runs as follows:

Clone the configured repository and checkout the target branch.
Locate supported localization files using consistent parser detection.
Validate file safety and encoding before parsing.
Parse base-language and localized entries.
Store extracted keys and track translation state over time.
Detect changes, renames, and stale keys (if enabled).
Queue missing or needs-review translations.
Run quality checks (if enabled).
Write normalized files back to disk.
Deliver the result through the matching automation path, usually a localization pull request or a pull-request review check.

For custom projects, the flow is similar but excludes repository cloning and repository automation behavior. Custom projects can receive files from dashboard uploads or the Custom Pipeline API.

Parser Detection & File Discovery

String Catalog selects a parser based on file type and structure:

.xcstrings
strings.xml
i18next .json files in supported locale path layouts
TestFlight WhatToTest.<locale>.txt files

Repository projects discover files by scanning the checked-out working tree. Custom projects operate on the uploaded file payload. Dashboard uploads currently accept .xcstrings; the Custom Pipeline API accepts .xcstrings, Android XML, and i18next JSON. In all supported flows, parser selection is consistent so the same inputs produce the same outputs.

Each parser is responsible for:

Determining base-language records
Identifying localized variants
Preserving format-specific structure and markup
Handling file-structure-specific edge cases

Only supported and validated files enter the pipeline.

Validation & Safety Model

Before parsing, each file is validated to ensure safe and predictable processing.

Validation includes:

Encoding and structure checks
Unsupported payload detection
Format-specific safety checks

Files that fail validation are rejected before parsing begins.

State Tracking Model

String Catalog maintains structured state for each string entry:

Base-language value
Localized values
Translation status
Needs-review flag
Stale marker (if applicable)

This state model allows the system to reason about:

When translations are missing
When base content changes
When keys are renamed
When localized values require review

Rename-Aware Retranslation

When enabled, rename-aware retranslation detects key identifier changes where the underlying value is carried forward.

This is especially important when localization files are modified outside of Xcode or other IDE tooling, where rename metadata may not be preserved.

Behavior:

If a key is renamed but its value is carried forward, translations can often be preserved.
If the value meaningfully changes, translations are requeued.
Detection is designed to stay reliable even when editor-generated rename metadata is incomplete or missing.

This helps prevent translation loss during refactors while maintaining integrity when semantics change.

Needs-Review Propagation

When a base-language value changes, non-base locales may be marked as NeedsReview.

This ensures:

Outdated translations are surfaced.
Review state propagates consistently from base to non-base locales.
Changes do not silently overwrite existing translations.

Internal safeguards help reduce redundant review triggers and duplicate processing.

Locale Normalization & Preferred Codes

Locale variants are normalized internally to ensure consistent tracking.

For example:

en, en-US, and en_US are canonicalized for state tracking.
Preferred codes are preserved when writing back to disk (for example en-US).
Locale matching remains predictable across parsers.

Normalization prevents duplicate locale entries while maintaining compatibility with platform expectations.

Stale Key Handling

When enabled, stale key removal can delete entries marked as stale.

Note

Stale-key handling is more conservative for repository projects than custom projects. Repository sync generally preserves localized rows until source keys are actually removed, while custom project sync can prune rows that disappear from the uploaded payload.

Stale behavior differs by project type:

Repository projects generally preserve localized rows unless source keys are removed from the base language.
Custom projects can prune localized rows that disappear from incoming file payloads.

Stale removal is optional and only runs when explicitly enabled.

Write-Back Normalization

After translation and review processing, files are rewritten in normalized form.

Normalization ensures:

Correct encoding
Proper escaping
Preservation of inline XML markup (including elements such as xliff:g)
Platform-compliant structure for XCStrings and Android XML

String Catalog avoids double encoding and unintended structural mutations.

Repository Delivery Flow

For repository-based projects:

Push automation can open or update a localization pull request for matching branches.
Pull request automation can inspect incoming pull requests without modifying them by default.
Teams can optionally allow pull request automation to commit translation changes directly to the PR branch.
Manual sync can open a localization pull request for a one-off branch without saving a new automation rule.
Repeat syncs aim to avoid unnecessary churn when output is already normalized.

Important

String Catalog documents the delivery behavior in user-facing terms such as push automation, pull request automation, and manual sync. Internal execution details may evolve, but the safe review model stays the same: only supported localization files are processed, branch matching is explicit, and pull requests remain the default review surface for repository changes.

Background Processing

Sync and translation work runs asynchronously.

This helps:

Keep the UI responsive during larger runs.
Let longer localization jobs finish in the background.
Keep processing predictable as projects grow.

Summary

String Catalog provides a parser-aware localization automation pipeline that:

Understands platform-specific file formats.
Tracks translation state and review status.
Detects renames and base-language changes.
Normalizes locale handling.
Safely writes back into source-controlled projects.

The result is automated localization that preserves structure, reduces churn, and integrates cleanly into modern development workflows.