Safely convert strings to integers in Redshift

How to safely convert strings to integers in Redshift

on Apr 08, 2025

This is a very stupid problem. I am not writing this post because it's a fascinating topic---rather, I'm writing it in the hopes that you avoid the headaches that I've gone through scouring the internet for the best answer to this question:

Let's say I have a Redshift table users. This table gets loaded via some process I don't control. It contains a field, amount, that gets loaded as a varchar when it should really be an int. There are a very small number of records (< .01%) that are not valid integers, and so simple ::int fails. How do I convert it?

This situation happens constantly for analysts. We don't control our source data: data is being loaded into Redshift by products like Stitch and Fivetran and by the data engineers we work with. We have to deal with the data that comes to us, not the data we wish we had.

I've spent several hours searching for an elegant solution. This seems like a problem that people would run into all the time, right? There must be an elegant way.

As it turns out, BigQuery and Snowflake both have a function called try_cast. This function does exactly what you would want: if it's able to convert from one format to another, it does so. If not, it returns a null. Sweet. Unfortunately, Redshift doesn't have a try_cast function. I'm going to have to hack one together myself.

This is very solvable with Redshift UDFs, but I want to impose the constraint that the solution not involve a UDF. UDFs are annoying to install and manage; I'd prefer a solution that was pure SQL.

Here's my answer, in several steps:

1. Trim out spaces: trim(amount)

2. Convert non-integer fields to nulls via regex:

casen
when amount - ‘^[0–9]+$’ then amount
else null
end

3. Cast the resulting value: amount::int

Here's the whole thing put together:

case
when trim(amount) - '^[0-9]+$' then trim(amount)
else null
end::int as amount

Wrapping it up in a macro

Did you really want to think through how to solve that problem? Are you a regex expert? For me, the answers are no, and no.

That's why programming has functions: one programmer can figure out how to do something and wrap that logic up in a function so that other programmers can easily use the functionality without understanding the implementation. And if you're a dbt user, you can use and build your own SQL functions too.

I was annoyed at the amount of time I invested in this problem, so I wrote a try_cast macro for the dbt Redshift package (see it here). Now you can just write {{try_cast('amount', 'int')}}and you'll get back a safely-converted integer.

What Redshift hacks do you use that should really be turned into macros? We'd love contributions :)

⚡️Ready to improve your analytics engineering workflow? Get started with dbt today. ⚡️

FAQs: Converting Strings to Integers in Redshift

Converting strings to integers in Redshift becomes challenging when you encounter invalid values in your data that prevent straightforward type casting with ::int. This occurs because analysts don't control the data from an upstream source like Stitch and Fivetran, with a small percentage of records (typically <0.01%) containing non-integer values that cause conversion failures.

Unlike BigQuery and Snowflake, Redshift does not have a native try_cast function that would automatically handle invalid conversions by returning null. This forces developers to create custom solutions using case statements with regex validation to safely handle type conversions without throwing errors.

dbt users can leverage the try_cast macro available in the dbt Redshift package, allowing them to simply write {{try_cast('amount', 'int')}} in their SQL. This macro encapsulates the regex validation and casting logic, providing a cleaner solution that doesn't require understanding the complex implementation details.

This conversion pattern is essential for data analysts who need to transform data they don't control into proper types for analysis. It helps maintain data pipeline stability by converting invalid values to nulls rather than failing the entire query.

Published on: Mar 15, 2018

Don’t just read about data — watch it live at Coalesce Online

Register for FREE online access to keynotes and curated sessions from the premier event for data teams rewriting the future of data.

Secure your spot now

Set your organization up for success. Read the business case guide to accelerate time to value with dbt.

Read now

VS Code Extension

The free dbt VS Code extension is the best way to develop locally in dbt.

Install free extension

Latest posts

Insights4 min

Overcoming impostor syndrome in tech

Kathryn Chubb

on Aug 26, 2025

Insights7 min

Under the hood of Apache Iceberg

Daniel Poppy

on Aug 24, 2025

Learn15 min

Build reliable AI agents with the dbt MCP Server

Kathryn Chubb

on Aug 21, 2025

The dbt Community

Join the largest community shaping data

The dbt Community is your gateway to best practices, innovation, and direct collaboration with thousands of data leaders and AI practitioners worldwide. Ask questions, share insights, and build better with the experts.

Join the Community Explore the community

100,000+active members

50k+teams using dbt weekly

50+Community meetups

How to safely convert strings to integers in Redshift

Wrapping it up in a macro

⚡️Ready to improve your analytics engineering workflow? Get started with dbt today. ⚡️

FAQs: Converting Strings to Integers in Redshift

Why is converting strings to integers in Redshift challenging?

Does Redshift have a built-in function like try_cast?

How can dbt users simplify this string-to-integer conversion?

Why is this conversion pattern important for data analysts?