Introducing `for let` and `for reduce`

c13e · December 25, 2021, 2:54pm

Thanks for putting together a detailed & well-presented proposal

I’m for the most part happy with it but would like to share a couple aspects of it that felt a bit ‘off’ at first encounter & would love to hear what others think:

The implicit asymmetry between return value of single iteration and return value of whole comprehension as shown in screenshot below. The fact that first element of tuple accumulates via mapping while second element accumulates via reduction is not explicit and tripped me up

elixir_proposal1091×196 21.9 KB
The case for for reduce doesn’t seem very compelling. AFAIK the same functionality can be accomplished with current :reduce option and it’s not clear to me what advantage the new syntax brings in this case?

elixir_proposal_21085×516 42.7 KB
Initialization before the comprehension: as others previously pointed out, I can see this leading to some confusion. For ex: seeing for let lesson_counter, lesson <- section["lessons"] do somewhere in the codebase without lesson_counter’s initialization co-located, when it’s a variable that will potentially be updated in each iteration
One of the great things about Elixir is the focus on explicitness and I’m a bit concerned we would be giving up some ground here with some aspects of the proposed solution, ex: use of tuple as return value with implicit reliance on position within the tuple for things like error messages. The examples used reductions over simple values like integers but would ComprehensionError for instance still work with values like nested tuples that could end up looking like valid output?

paulstatezny · December 25, 2021, 3:09pm

What if for “magically” scooped up re-bound let variables and returned them at the end without needing to explicitly return them inside the for?

for let count = 0, sum = 0, i <- [1, 2, 3] do
  sum = sum + i
  count = count + 1
  i * 2
end

# returns {[2, 4, 6], %{sum: 6, count: 3}}

You can re-bind the let variables in each iteration, and at the end for scoops up the final values and returns them in a map.

To me, the original proposal where you have to return a tuple is destined to be confusing. Folks are used to whatever you return inside for getting put in a list. It’s bizarre to me that you’d do that and it would gather the first part of the tuple in a list, but “discard” the 2nd part until the final iteration.

My proposal above involves “magic” (implicit) behavior, but to me seems less confusing than the tuple convention.

If returning a map is controversial, maybe we can return a tuple instead:

{[2, 4, 6], {6, 3}}

josevalim · December 25, 2021, 3:34pm

If we introduce for-let, then I think we should have for-reduce for consistency and deprecate the :reduce option. The proposal explains why having it at the beginning allows more possibilities, but other than that they are quite close.

The tuple contract is how it works with map_reduce and that is what we are trying to mirror here. We could try to steer away from it, but then we venture further into unknown territory, which was more generally disliked in previous proposals. The comprehension error won’t catch false positives though, unless we have a type system.

You can re-bind the let variables in each iteration, and at the end for scoops up the final values and returns them in a map.

This was actually my first proposal but it was generally disliked. You can see the elixir-lang-core mailing list for more discussion on that. I would link but I am currently on my phone.

—-

My thoughts on this topic have changed several times but right now I am closer to for_let and for_reduce than for let. The reason is simple: different return types should have different functions.

thiagomajesk · December 25, 2021, 4:35pm

This is great to hear @josevalim! One of the things I was reluctant about in the proposal is that this “modal” behavior doesn’t exist in other parts of the language like many others mentioned, so it makes total sense to have different expectations about the function in this context…

However, now I understand (based on your previous comment about the special forms), the ideal solution would be adding no new keywords. So, I’m wondering after reading @soup’s comment if something like try let would make sense in Elixir. Does the proposed concept of ‘qualifier’ works in other parts of the language?

Another thing, have you played with the idea of combining the behavior of both for-map and for-reduce in one place? Functions like Enum.chunk_while and Enum.group_by where you control how the return values are dealt with keep coming to mind when I think about the problem at hand.

PS.: I think the point of Phoenix already using let in heex kinda starts validating the proposal to me (out of practicality mostly).

paulstatezny · December 25, 2021, 4:52pm

The tuple contract is how it works with map_reduce and that is what we are trying to mirror here.

Great point Makes sense to me.

dmitriid · December 25, 2021, 11:11pm

for sum := 0, count := 0, i <- [1, 2, 3] do
  sum = sum + i
  count = count + 1
  {sum, count}
end

This keeps the shape of the for:

Before:

Generators are <-

Filters are = or function calls, or basically anything that returns a bool value

After:

Initializers are :=

Generators are <-

Filters are = or function calls, or basically anything that returns a bool value

Upside:

“Shape of things” is preserved
Each separate “thing” has it’s own operator (:=, <-)

Downside:

probably less readable
introduces a new operator that doesn’t exist in other places in the language

(Another downside that many have already mentioned this stops being a Enum.map, and becomes either a map or a reduce depending on whether or not we have initial values)

opsb · December 26, 2021, 12:16am

Having separate names does seem to allow for a lot of freedom. It makes it easy to document the different types of comprehensions and easy to google them when you come across them for the first time.

danj · December 26, 2021, 5:13pm

I do like for alone, this being a single unusual special form under which all this functionality groups. Maybe it’s odd to have options that alter the return form, but breaking them up into for_map and for_reduce may not add a stitch of clarity to the real problem.

The initial binding is the sticky bit because it’s also controlling the result shape. How about something explicit for returns {:acc, {sum=0,count=0}}… and for returns {sum=0,count=0}…

derekbrown · December 26, 2021, 10:09pm

Another +1 vote for using init in some way (with or without parens).

For many people coming from other languages, let has so much baggage around mutability that it may be more of a hindrance than a help.

A couple of additional options that feel directionally Elixir-ish:

Combining for and with
While I generally disagree with including additional forms like for_reduce or for_let, there could be an opportunity to mix for and with on this occasion? My mental framework has with reading as something like “assuming (given conditions succeed) do this…”. Combining for and with (either syntactically via punctuation/blocks or literally as for_with) would read something like “assuming we’re able to initialize these variables, then execute this comprehension”.

Leveraging guard-like syntax
We already have a way to say “perform this secondary check/action when doing this logic block”. The when syntax does this with guards. Why not have something similar for comprehensions? Could also combine the above with logic, or use an aforementioned option as well (init remains my personal favorite):

for i <- [1,2,3] with {sum, count} <- {0,0} do...

for i <- [1,2,3] init {sum, count} <- {0,0} do...

for i <- [1,2,3] let {sum, count} <- {0,0} do...

for i <- [1,2,3] first {sum, count} <- {0,0} do...

Yes, there is some overloading here, but it’s not so foreign.

gregvaughn · December 26, 2021, 10:35pm

This is very interesting! I like how it keeps everything under the for namespace of special forms, plus it makes the return value very clear (and even custom – if you want the “reduce” part of map_reduce before the “map” part, you can do that). It also leaves open future extensions such as async or even filters on accumulated values before generators are specified.
The one difference I suggest is that :map (or :values?) be used in place of where you used :acc since sum and count are accumulation variables (the reduce part of map_reduce).

thiagomajesk · December 27, 2021, 1:01am

I think the idea is good, but it suffers from the same “problems” as for let and for reduce - syntax that isn’t available outside of this scope. If specifying the ‘qualifiers’ at the end was feasible, I’d rather leave it like the other :into, and :reduce options. Also, the similarities with the guard’s syntax seem only superficial to me.

This is also interesting, but if that were the case, I’d prefer if we could retain compatibility with the Enum.chunk_while/4 return structure or something (by specifying how the results are going to be handled). However, I don’t think supporting distinct result types in for would be a good idea if we care about ergonomics, so I’d rather have another keyword in place if that was the case (different expectations for different functions).

derekbrown · December 27, 2021, 1:36am

I think the idea is good, but it suffers from the same “problems” as for let and for reduce - syntax that isn’t available outside of this scope.

There’s no reason why an init “guard” couldn’t be used in other scopes. I could certainly see it being useful outside of comprehensions.

I’ll refrain from responding to the “superficial” statement.

thiagomajesk · December 27, 2021, 3:16am

Just to expand on this a little more… I personally don’t think that the correlation would be easier for beginners because guards syntax is already well-defined in elixir by using the when keyword. Also, is not usual to use guards with functions that don’t return a boolean value (if I’m not mistaken, but I might be wrong on this). So, it seems that instead of “similar to guards”, it’s just ‘qualifiers’ in a different position.

Could you elaborate on other use cases you’ve thought would be useful (similarly to the proposed usage, at least)? One of the concerns brought up in previous discussions was that ‘qualifiers’ were strange and specific to for and not commonly seen in other parts of the language like I said.

derekbrown · December 27, 2021, 4:44am

Guards are a qualifier in the position that I’m referencing. That’s the analog.

Could you elaborate on other use cases you’ve thought would be useful (similarly to the proposed usage, at least)?

One example, when piping into conditionals like case/cond, it could be nice to have additional data there with the conditional itself, rather than “init-ing” values above the pipe chain. Could possibly also be used in the function head as a more explicit default for recursive functions.

brettbeatty · December 27, 2021, 3:53pm

That’s something I’ve thought would be useful to add to for comprehensions (I actually hacked on a macro for this recently). In my mind it makes comprehensions more composable–you can pass the comprehension streams into additional comprehensions or Enum/Stream functions without many traversals of your list. I see stream comprehensions as the Ecto.Query.from/2 macro if the Stream module is the rest of Ecto.Query (hopefully that makes sense).

I think in this discussion folks are more looking for a way to reduce without leaving for, but I like the idea of being able to mix & match.

stefanchrobot · December 28, 2021, 8:25am

Agreed on this a lot! I particularly don’t like how Ecto’s Repo.transaction changes behaviour based on whether you’re passing a function or a Multi. Unless there’s something I’m missing, there should be Multi.transaction(multi) instead of Repo.transaction(multi) (or maybe Multi.run to mirror Stream.run except that Multi.run is already a thing).

I’d prefer for let and even forlet instead of for_let. While those are all macros or special forms, there are some constructs that are treated as “keywords” (defmodule, for, while, if, etc.). Somehow for_let doesn’t fit in there.

But… another approach to solving this would be to declare the shape of the returned value in the for’s “header”. Also, thinking about this more, reduce is not friendly to people coming from imperative languages. I think talking about the “returned value” is way more familiar, so I’m also proposing to get rid of reduce altogether in favour of return.

So a map-reduce would be:

for return {sum = 0, count = 0, i <- [1, 2, 3]} do
  sum = sum + 1
  count = count + 1
  {sum, count, i}
end

and reduce:

for return {sum = 0, count = 0}, i <- [1, 2, 3] do
  sum = sum + 1
  count = count + 1
  {sum, count}
end

josevalim · December 28, 2021, 10:17am

I have considered this route but I can’t come up with any reasonable syntax. Putting the generator as part of the return type feels incorrect, the generator is not really part of the result and you can have multiple generators, which would not make sense either. We would need a way to refer to the output but all of them would feel magical.

stefanchrobot · December 28, 2021, 10:53am

I think that the following would make sense, except that I’m not sure what should be done with pattern matching.

for i <- [1, 2, 3], return {sum = 0, count = 0, i} do
  sum = sum + 1
  count = count + 1
  {sum, count, i}
end

Flipping the order maybe would solve that by “enforcing” to bind to a variable:

for return {sum = 0, count = 0, i}, <some pattern> = i <- [1, 2, 3],  do
  sum = sum + 1
  count = count + 1
  {sum, count, i}
end

Did you consider a return-like macro inside the block? I know that some options have been dismissed on the mailing list due to “refactorability” of the code, but to me this seems similar to rolling back Ecto transactions - there is some minimal amount of plumbing/wiring needed:

Repo.transaction(fn ->
  # ...
  # do stuff, but at some point:
  |> case do
    {:ok, foo} -> foo
    {:error, reason} -> Repo.rollback(reason)
  end
end)

So how about something like:

for i <- [1, 2, 3], sum = 0, count = 0 do
  sum = sum + 1
  count = count + 1
  continue {i, sum, count}
end

I guess I’m starting to lean towards something like after from the initial proposal, but maybe more of a single expression than imperative rebindings.

EDIT: after would work too, but continue would be best for people from imperative background:

for i <- [1, 2, 3], sum = 0, count = 0 do
  sum = sum + 1
  count = count + 1
after
  {i, sum, count}
end

If we had continue I’d love to see break for early exits.

fidr · December 28, 2021, 11:12am

I feel like for is already an odd construct, because it behaves like a Enum.map (with nesting and filtering). Ideally I think you’d want generators to be more integrated with the standard library, but that’s hard to do right now.

To avoid confusion, I would let the for contruct mirror the functions we know as much as possible, instead of making up new terms for the same functions we already have in Enum.

Something like this (map can be the default unless specified):

for map x <- [1,2,3], y <- [4,5,6] do
  {x, y}
end

for map_reduce acc = 0, x <- [1,2,3], y <- [4,5,6] do
  {{x, y}, acc + x + y}
end

for reduce acc = 0, x <- [1,2,3], y <- [4,5,6] do
  acc + x + y
end

This will also leave some room for possible future expansion for any other enumerable functions.

wmnnd · December 28, 2021, 4:10pm

This is a neat idea, I like how explicit and intuitive to understand it is!