tidyr separate_wider_position() in R: Split by Character Position

The separate_wider_position() function in tidyr 1.3 splits a string column into multiple columns based on FIXED CHARACTER POSITIONS. It is the right tool for fixed-width formats like dates without delimiters or coded IDs.

By Selva Prabhakaran · Published May 12, 2026 · Last updated May 12, 2026

⚡ Quick Answer

df |> separate_wider_position(col, widths = c(year=4, month=2, day=2))
df |> separate_wider_position(col, widths = c(prefix=3, code=5, suffix=2))
df |> separate_wider_position(col, widths = c(year=4, NA, month=2)) # skip
df |> separate_wider_delim(col, delim = "-")  # different: delimiter-based
df |> separate_wider_regex(col, patterns = c(...)) # regex-based

Need explanation? Read on for examples and pitfalls.

📊 Is separate_wider_position() the right tool?

What separate_wider_position() does in one sentence

separate_wider_position(data, cols, widths) splits each value of cols at the cumulative positions defined by the named integer vector widths. Each name becomes a new column with the corresponding number of characters.

Syntax

separate_wider_position(data, cols, widths, too_few = "error", too_many = "error", cols_remove = TRUE). widths is a named integer vector.

Run live

Run live, no install needed. Every R block on this page runs in your browser. Click Run, edit the code, re-run instantly. No setup.

RParse YYYYMMDD

library(tidyr) library(dplyr) df <- tibble(date_str = c("20240115","20240320")) df |> separate_wider_position(date_str, widths = c(year=4, month=2, day=2)) #> year month day #> 2024 01 15 #> 2024 03 20

Tip

Use named widths to label each segment. Skip a segment by using NA as the name: widths = c(year=4, NA, month=2) skips 2 characters between year and month.

Five common patterns

1. Date string

RYYYYMMDD format

df |> separate_wider_position(d, widths = c(year=4, month=2, day=2))

2. Code with prefix and suffix

RParse 'PRE12345AB'

df <- tibble(code = c("PRE12345AB")) df |> separate_wider_position(code, widths = c(prefix=3, num=5, suffix=2)) #> prefix num suffix #> PRE 12345 AB

3. Skip middle characters

RSkip a separator character

df <- tibble(x = c("AB-12-CD")) df |> separate_wider_position(x, widths = c(a=2, NA, num=2, NA, b=2)) #> a num b #> AB 12 CD

4. Handle short strings with too_few

RShort rows tolerated

df <- tibble(x = c("ABC123","XY")) df |> separate_wider_position(x, widths = c(a=3, b=3), too_few = "align_start")

5. Combine with other tidy operations

RParse then transform

df |> separate_wider_position(date_str, widths = c(year=4, month=2, day=2)) |> mutate(across(everything(), as.integer))

Key Insight

Each width represents a CHARACTER COUNT, not a position. widths = c(a=3, b=2) means "first 3 chars to a, next 2 chars to b". Cumulative positions: a is chars 1-3, b is chars 4-5.

separate_wider_position() vs separate_wider_delim() vs str_sub

Function	Splits by	Best for
`separate_wider_position()`	Character positions	Fixed-width formats
`separate_wider_delim()`	Delimiter	Variable-width parts
`separate_wider_regex()`	Regex groups	Pattern-based
`stringr::str_sub()`	Position substring	One column at a time

When to use which:

separate_wider_position for FIXED-WIDTH (dates, codes).
separate_wider_delim for DELIMITED.
separate_wider_regex for COMPLEX patterns.

A practical workflow

Use for fixed-width data formats common in legacy systems.

RInteractive R

df |> separate_wider_position( transaction_id, widths = c(year=4, month=2, day=2, branch=3, seq=6) )

Parse a structured ID into its semantic components in one step.

Common pitfalls

Pitfall 1: forgetting to name widths. Unnamed widths are dropped (treated as skip). Always name segments you want to keep.

Pitfall 2: mismatched total width. If widths sum to less than string length, extras are dropped silently. Use too_many = "error" to catch.

Warning

separate_wider_position() operates on CHARACTER counts, not BYTE counts. For multi-byte UTF-8 strings, the count is by codepoint, not byte.

Try it yourself

Try it: Split a 7-character ID like "A12-CD" into 3 named segments. Save to ex_parsed.

RYour turn: parse fixed-width ID

df <- tibble(id = c("A12-CD")) ex_parsed <- df |> # your code here names(ex_parsed) #> Expected: c("letter", "num", "code")

Click to reveal solution

RSolution

ex_parsed <- df |> separate_wider_position(id, widths = c(letter=1, num=2, NA, code=2)) ex_parsed #> letter num code #> A 12 CD

Explanation: Skip the dash between num and code with NA.

After mastering separate_wider_position, look at:

separate_wider_delim(): delimiter-based
separate_wider_regex(): regex-based
separate_longer_delim(): split into rows
unite(): combine columns
stringr::str_sub(): position-based substring

FAQ

What does separate_wider_position do in tidyr?

Splits a string column into multiple columns based on character widths. Each width specifies how many characters go into each new column.

How do I skip characters with separate_wider_position?

Use NA as the name in widths: widths = c(a=3, NA, b=2) skips one character between a and b.

What is the difference between separate_wider_position and separate_wider_delim?

position uses fixed character counts; delim uses a delimiter. Use position for fixed-width (YYYYMMDD); delim for variable parts (a-b-c).

Can I parse multi-byte UTF-8 with separate_wider_position?

Yes. Counts are by codepoint, not byte.

What happens if the string is shorter than the widths?

By default it errors. Pass too_few = "align_start" to fill with NA, or "align_end" for right-alignment.

Navigate

Tidyverse packages

Deep dives

Wrangling & EDA

Statistics

Machine Learning

Time Series

By Industry

Reporting & Apps

Levels

tidyr separate_wider_position() in R: Split by Character Position

What separate_wider_position() does in one sentence

Syntax

Five common patterns

1. Date string

2. Code with prefix and suffix

3. Skip middle characters

4. Handle short strings with too_few

5. Combine with other tidy operations

separate_wider_position() vs separate_wider_delim() vs str_sub

A practical workflow

Common pitfalls

Try it yourself

FAQ

Navigate

Tidyverse packages

Deep dives

Wrangling & EDA

Statistics

Machine Learning

Time Series

By Industry

Reporting & Apps

Levels

tidyr separate_wider_position() in R: Split by Character Position

What separate_wider_position() does in one sentence

Syntax

Five common patterns

1. Date string

2. Code with prefix and suffix

3. Skip middle characters

4. Handle short strings with too_few

5. Combine with other tidy operations

separate_wider_position() vs separate_wider_delim() vs str_sub

A practical workflow

Common pitfalls

Try it yourself

Related tidyr functions

FAQ