darkmarmot

Strange behavior with SHA vs phash2

We have some code that creates unique hashes on documents to handle duplicate checks.

It wasn’t working on nodes that rebooted (letting dupes through) and as a last resort I changed from SHA to phash2 and it works perfectly now. This seems really strange to me. Is there some kind of timestamp seed or something in the crypto SHA I’m not aware of?

Here’s the code change that fixed our issues:

hash =
  :erlang.phash2(doc, 1_000_000)

Here’s the original that is not working across restarts:

hash =
  :crypto.hash(:sha, :erlang.term_to_binary(doc)) |> Base.encode64(case: :lower)

Maybe term_to_binary or encode64… but… it just surprised me a lot…

2 comments

#crypto #hash

5 1381 2

2018-08-09 02:12:23 UTC

Most Liked

cmkarlsson

Sorry for a few follow-up questions. I am not even sure I can help here.

Can you consistently reproduce this, the same document always produce different results after reboot of a node?
What is doc? Is it a binary string or an erlang term? Something else?
Is the initial hash and hash done on rebooted node done on the same nodes?
If on different nodes are they running on different server architecture?
Are you able to try the different parts of the sha hash individually to see which one is not working.

My bet is that it is term_to_binary. I am not sure it is meant to always produce the same result, only that it can be decoded with binary_to_term.

phash2 on the other hand is designed to always produce consistent result when run on different version of erlang and machine architectures.

Post #2

benwilson512

Author of Craft GraphQL APIs in Elixir with Absinthe

The collision chance for phash2 is gonna be a lot higher than a sha though, so I’d definitely recommend finding a way to use a sha.

Post #3

Where Next?

View thread on forum (has 2 responses!)

crypto

hash

Home Questions & Help>Questions

#crypto #hash

5 1382 2

Last post

Popular in Questions

Questions & Help>Questions

How to set environment variables in dev.exs?

Hi All, I set a environment variables in dev.exs , like below code. when i start server, how can i set the ${enable} value? thanks. d...

/phoenix

31 21927 15

2021-03-16 00:58:41 UTC

New

Questions & Help>Questions

Updating a field using Ecto one-liner?

In Ruby, I can go: User.find_by(email: "foobar@email.com").update(email: "hello@email.com") How can I do something similar in Elixir? ...

15 14661 3

2018-01-18 17:34:10 UTC

New

Questions & Help>Questions

Write while loop equivalent in elixir

I have a another noob question about loop. Since elixir is immutable, while loop is not directly possible. total = 10 while total != 0 ...

#while #loop

60 28868 36

2023-10-19 09:05:29 UTC

New

Questions & Help>Questions

How to fix Bad argument in call to erlang:'++'(<<"xxx/crash.log">>, ".3") in lager_rotator_default:rotate_logfile/2 line 84

Erlang/OTP 25 [erts-13.2.2] [source] [64-bit] [smp:8:8] [ds:8:8:10] [async-threads:1] 15:22:35.803 [error] gen_event {lager_file_backend...

#production #error #log

2 46403 2

2024-02-18 13:22:44 UTC

New

Questions & Help>Questions

How to convert map to string (separated with ,)

Hello, I have map which I want to convert it to string like this: the map: %{last_name: "tavakkoli", name: "shahryar"} the string I ne...

#maps #strings

15 15660 2

2019-03-08 10:48:10 UTC

New

Questions & Help>Questions

Ecto delete a record WITHOUT selecting first

Forgive me if this is obvious, but how does one delete a database record WITHOUT selecting it first? Ecto.Repo — Ecto v3.14.0 has exampl...

#ecto #optimization #delete-method

22 16944 8

2021-03-03 16:08:24 UTC

New

Questions & Help>Questions

Updating structs: Map.put vs %Foo{oldfoo | new: value} vs put_in

Original source of discussion: This topic on the Pragmatic Programmers’ Functional Web Development with Elixir, OTP, and Phoenix forum. ...

#maps #structs #pipeline #access

115 28772 31

2020-07-04 06:01:18 UTC

New

Questions & Help>Questions

(Postgrex.Error) ERROR 42804 (datatype_mismatch): column "" cannot be cast automatically to type integer

I have a User schema with a :from_id field set to type :string: defmodule TweetBot.Repo.Migrations.CreateUsers do use Ecto.Migration ...

#ecto

29 13524 4

2018-09-22 00:54:36 UTC

New

Questions & Help>Questions

Transform a list into an map with indexes using Enum module

Hi, I need to transform a list of numbers into a map where the keys are the indexes and the values are the original values of the list. ...

35 32831 9

2016-09-01 23:06:05 UTC

New

Questions & Help>Questions

Ecto: Validating belongs_to association is not nil?

Okay, I’m having a heck of a time trying to figure out how to best handle the validation of belongs_to associations in Ecto. I’m sure I’...

#ecto

29 17948 25

2023-08-04 11:00:52 UTC

New

Other popular topics

News>Announcing

Oban - Reliable and Observable Job Processing

Hello! tl;dr Announcing Oban, an Ecto based job processing library with a focus on reliability and historical observability. After spen...

/oban #ecto #postgres #job-processing

985 42920 311

2026-03-25 15:49:12 UTC

New

Questions & Help>Questions

Anonymous functions with multiple body

Hi guys, i’m new in the Elixir world, and i have to say, that i love it! i’m having some problem to understand anonymous functions with ...

19 21684 4

2017-02-16 19:25:58 UTC

New

Questions & Help>Questions

Failed to run 'elixir' command error in vs code

Using vs code and installed ElixirLS: support and debugger. And I got an error popped up on start up says Failed to run ‘elixir’ comma...

#vscode #elixir-ls

49 16580 39

2025-08-20 18:57:04 UTC

New

Chat & Discussions>Discussions

Elixir Code Editors & IDEs - which one are you using? (Poll)

Please see the new poll here: Which code editor or IDE do you use? (Poll) (2022 Edition) It’s been a while since we first asked this, I...

#code-editors

208 31142 143

2019-10-07 16:02:20 UTC

New

Chat & Discussions>Wikis

Emacs - Elixir Setup Configuration Wiki

This post is a wiki (feel free to hit the edit button near the bottom right of this post to add your own changes!) This post collects co...

#spacemacs #wiki #emacs #code-editors #language-server-protocol

239 47930 226

2025-01-03 12:54:22 UTC

New

Questions & Help>Questions

How to set up the Elixir SDK in Intellij IDEA with the intellij-elixir plugin?

Hello! Sorry for this astonishing simple question, but I’m really stuck. I try to set up the intellij-elixir plugin, but I don’t know ho...

17 23170 9

2020-08-23 02:19:55 UTC

New

Blogs & Podcasts>Blog Posts

Elixir Blog Posts

Update: How to use the Blogs & Podcasts section You can post links to your blog posts or podcasts either in one of the Official Blog...

hexdocs.pm

#blog-posts #wiki #stickies #official-blog-posts-thread

3271 126479 1222

2025-10-04 00:32:54 UTC

New

Chat & Discussions>Discussions

LiveView demos, examples, and sample apps thread!

Seen any cool LiveView demos, sample apps or examples? Please post them here! :003:

/phoenix #liveview

232 30566 60

2021-07-02 10:53:43 UTC

New

Questions & Help>Questions

How to get struct from map - elixir?

Lets say I have map like this fetching from my database %{"_id" => #BSON.ObjectId<58eb1a7a9ad169198c3dXXXX>, "email" => ...

/phoenix #ecto #maps #structs

38 34876 34

2025-08-22 12:15:57 UTC

New

Chat & Discussions>Discussions

What's a great modern drag and drop javascript library you recommend?

Kind of like when jquery came out, it was super necessary. Existing drag and drop libraries have a bunch of baggage to support old browse...

/phoenix #javascript

66 19579 25

2024-05-30 18:55:03 UTC

New

Questions & Help>Questions

Help with elixir-ts-mode in doom-emacs config

Questions & Help>Questions

Are Vi keybindings possible inside IEx?

Questions & Help>Questions

I miss the ternary operator - does anyone have a macro that allows a ternary operator in Elixir code?

Questions & Help>Questions

Empty Result on Generic Action with graphql_unnested_unions

Questions & Help>Questions

Clarification about `assign/2,3` usage in `render/1` callbacks

Questions & Help>Questions

With the new 1.20 release does it change the way you see Gleam?

Questions & Help>Questions

Using Phoenix.LiveView.TagEngine as an EEx.Engine is deprecated!

Questions & Help>Questions

About ambiguity introduced in function default arguments

Questions & Help>Questions

OpenApiSpex schema - are there any naming conventions on handling show and index routes?

Questions & Help>Questions

How to get type warnings before test failure reports

Questions & Help>Questions

Questions Questions ❯

Latest on Elixir Forum

bluez - bluez over d-bus library

News>Announcing

Potions - deploy and manage Phoenix apps on your own VPS

News>Announcing

Senior Full Stack Engineer (Elixir, React) - Rabbet, Austin, Remote USA (TX, CO, NC preferred)

Jobs & Member Profiles>Jobs

2026/09/09 - Building Local-First Apps in Pure Elixir with Hologram (ElixirConf US training) - Chicago, USA

Events/Confs/Meet Ups>List

Let libraries be libraries

Blogs & Podcasts>Blog Posts

Nature_whistle v0.3.0 is out - telemetry driven alerting with recovery notifications

News>News & Updates

What do we do with logging in libraries?

Blogs & Podcasts>Blog Posts

Software Engineer - Soluna, Remote USA

Jobs & Member Profiles>Jobs

Getting tsvectors: error] ** (Postgrex.Error) ERROR 42703 (undefined_column) record “new” has no field “business_id”

Questions & Help>Troubleshooting

Keynote: DurableServer: Always Running Somewhere - Chris McCord | ElixirConf EU

Learning Resources>Talks

finance - XIRR, NPV and other financial calcs matching Excel/Sheets

News>Announcing

Oaskit 0.14.1 - security release

News>News & Updates

API Management Console - runtime route toggling for Phoenix apps

News>Announcing

Testers wanted: protocol pruning for smaller client bundles

News>RFCs

Update from the Phoenix Team - Steffen Deusch | ElixirConf EU

Learning Resources>Talks

Elixir Forum ❯

Sub Categories:

Forums

We're in Beta

About us Mission Statement

Strange behavior with SHA vs phash2

darkmarmot

Strange behavior with SHA vs phash2

Most Liked

cmkarlsson

benwilson512

Where Next?

Popular in Questions

How to set environment variables in dev.exs?

Updating a field using Ecto one-liner?

Write while loop equivalent in elixir

How to fix *Bad argument in call to erlang:'++'(<<"xxx/crash.log">>, ".3") in lager_rotator_default:rotate_logfile/2 line 84*

How to convert map to string (separated with ,)

Ecto delete a record WITHOUT selecting first

Updating structs: Map.put vs %Foo{oldfoo | new: value} vs put_in

(Postgrex.Error) ERROR 42804 (datatype_mismatch): column "" cannot be cast automatically to type integer

Transform a list into an map with indexes using Enum module

Ecto: Validating belongs_to association is not nil?

Other popular topics

Oban - Reliable and Observable Job Processing

Anonymous functions with multiple body

Failed to run 'elixir' command error in vs code

Elixir Code Editors & IDEs - which one are you using? (Poll)

Emacs - Elixir Setup Configuration Wiki

How to set up the Elixir SDK in Intellij IDEA with the intellij-elixir plugin?

Elixir Blog Posts

LiveView demos, examples, and sample apps thread!

How to get struct from map - elixir?

What's a great modern drag and drop javascript library you recommend?

Questions & Help>Questions

Latest on Elixir Forum

Sponsor Spotlight

Our Sponsors

Categories:

Sub Categories:

Forums

Popular Tags

Our Sponsors

We're in Beta

How to fix Bad argument in call to erlang:'++'(<<"xxx/crash.log">>, ".3") in lager_rotator_default:rotate_logfile/2 line 84