snofang

Ecto.CastError Mixed Keys Issue

In a typical business development task, having a function in a context module which accepts attributes of map type and passes them to Ecto for a validated Changeset is a common case.

  def create_foo(%{} = attrs) do
    attrs
    |> Foo.create_changeset()
    |> Repo.insert()
  end

Those functions may be called from everywhere; Normally, it is quite common to express the keys by atom while calls from Phoenix Controllers and LiveViews have keys in string. So these business functions should support both types of keys and thanks to the cast method, they do by default.

But, what if by some requirement it is needed to change the passed in attributes before passing them to Ecto for changeset or validations? By which data type the attribute keys should be addressed? string or atom? Suppose the following function:

  def create_bar_foo(%{} = attrs) do
    attrs
    |> Map.put(bar_case: :default_or_computed_value)
    |> Foo.create_changeset()
    |> Repo.insert()
  end

If it called from a test with normal atom keys, every thing would be fine while if it called from a LiveView with string keys, there will be a raise:

(Ecto.CastError) expected params to be a map with atoms or string keys, got a map with mixed keys ...

So the keys should be normalized before processing and this article suggest some solution. Also this post addresses it somehow.

There can be lots of other solutions to this problem, for example one can simply cast passed attributes before manipulation them and so on. But I think this can be included in Ecto.Changeset.cast method because:

Ecto.Changeset.cast is already supporting both atom and binary keys and also touching the case by doing conversion from string keys to atom ones. Also it sounds feasible to support mix of both types.
There is no harm to existing codes as actually a new capability is being added.
It encourages having some business implemented in context before using Ecto directly or having lots of ###_changeset functions per specific business case.

10 comments

/phoenix #ecto

0 249 10

2023-09-21 00:04:08 UTC

Most Liked

tfwright

I think a more specific example use case might me useful here because I do think it is quite rare, but otherwise I strongly agree with other commenters that modifying params is an antipattern. In fact I would go as far as to say even if it was supported I wouldn’t make use of it. We as developers are naturally lazy and that is generally speaking a good thing as it motivates us to avoid producing spaghetti code. But here the extra code actually simplifies the design because it represents the innate complexity of the data insofar as it implicitly must deal with multiple input sources (as José says if the param in question is system data there is no reason to cast it). When troubleshooting/debugging or even just grokking part of a program it is very useful to be able to clearly and easily trace the path of a piece of data, and if necessary, modify it in isolation. Like many things it might seem a simple matter to recognize the “mixing” these things tend to multiply overtime and in aggregate produce more fragile, less maintainable code.

A computed/calculated value wouldn’t necessarily call for an extra changeset, but as shown in examples above it has its own logic path which starts from the changeset, not the changeset params.

Post #7

sodapopcan

I respectfully disagree with this whole proposal.

You semi-handwaved this solution but for me this is the right answer and one of those things I’m pretty religious about in my own code. We should be converting—ie casting—our data into a known shape before doing anything with it, and this is exactly what Changeset.cast provides us. This is especially important in a dynamic language.

For me, string keys mean “untrusted”. Using this definition, general application code should rarely ever have a need to set a string key (of course there are always exceptions).

Multiple changesets are generally encouraged by the framework. “Citation needed,” yes sorry I don’t have doc or discussion links atm, the best I can give right now is to look at the User schema generated by phx_gen_auth. I actually do prefer to keep a single changeset myself when possible (it’s not a hard rule) but it results in me writing functions like maybe_assign_slug/1 which are probably more complex than they need to be if I’d just use multiple changesets.

If you really want to do these things in the context, there is no harm in manipulating changesets in the context—changesets are kind of wild as they are having a strong presence in the web layer as well as the business layer.

Of course, you could also do stuff like this in the context:

def create_article(attrs) do
  %Article{}
  |> Article.changeset(attrs)
  |> MyApp.ChangesetHelpers.assign_slug_from(:title) # module name for illustrative purposes :D
  |> Repo.insert()
end

Finally, I think any promotion of the idea that mixing map key types is ok is a bad idea, again, especially in a dynamic language.

Post #2

josevalim

Creator of Elixir

It all depends on what is the source of the data.

If the source of the data is your own application, validations are pointless, because it makes no sense to tell the user that “bar_case is invalid” when they have no power over setting :bar_case. My reply was written from that perspective (I will clarify this in my previous message to avoid future confusion).

However, if you are manipulating other external params to compute values (and now reading back on your original thread, you said that this is indeed the case), then I agree with your concerns. In such cases, I would try to cast the initial params, then compute additional changes, and then validate the additional changes:

data
|> cast(params, ~w(foo baz))
|> validate_foo_and_baz()
|> compute_bar_case_as_change()
|> validate_bar()

But even this requires care. If you add a validation error to bar, and it is a computed value from foo and baz, you need to make sure the error points to the correct place.

Post #6

Last Post!

snofang

Yup! It sounds like my view came from considering more than needed dependency for each layer (probably from other ecosystems), to such an extent that there wasn’t any differentiation between internal and external data. And now it is getting clear to me how it sounds overly defensive.

Thank you all

But I still believe that the proposal itself and it’s workaround solution (the following) is true.

snofang:

  def changeset(data, any_params, custom_internal_params \\ []) do
    data
    |> cast(any_params, ~w(foo bar baz))
    |> change(custom_internal_attrs)
    |> all_validations() 
  end

p.s. by all_validations I don’t mean necessarily “all validations in one place”, but “all necessary validations”.

Post #11

Where Next?

View thread on forum (has 10 responses!)

phoenix

ecto

Home Chat & Discussions>Proposals: Ideas

/phoenix #ecto

10 249 10

Last post

Ecto.CastError Mixed Keys Issue

snofang

Ecto.CastError Mixed Keys Issue

Most Liked

tfwright

sodapopcan

josevalim

Last Post!

snofang

Where Next?

Popular in Proposals: Ideas

Organise files under *controllers* folder similar to *live* and context folder

Remove view layouts in favor of function components

Feature Request: Ignore "change" events of specific form inputs in LiveView

Add `:params` opt to JS.patch and JS.navigate, and opt to merge `phx-value-*`

Does Elixir Need an Image Generation Library?

Remove or disable warning about navigating across live sessions

Proposal: mix phx.gen.csp — CSP Level 3 support for Phoenix

Other popular topics

Params in the URL and body -- how does Phoenix handle them together?

Upgrading Elixir - how to check versions, delete, and upgrade?

How to set environment variables in dev.exs?

Pattern matching against a string

How can I write a raw sql query?

What's a great modern drag and drop javascript library you recommend?

Latest Phoenix Threads

Chat & Discussions>Proposals: Ideas

Latest on Elixir Forum

Sponsor Spotlight

Our Sponsors

Categories:

Sub Categories:

Forums

Popular Tags

Our Sponsors

We're in Beta

Organise files under controllers folder similar to live and context folder