pedromvieira

FLOW - How to Enhance performance from a series of flows?

I’m migrating some functions from parallel_stream to flow.
This function loads a xlsx file and generate data to import.
The ideia is to increase performance overall. So far so good, specially with files over 50k lines.
In some tests, went from 106 seconds (parallel_stream) to 62 seconds (flow).
Any tips to increase performance or do it in a few flows?

flow pipeline

  def flow_it(path, header, accepted, id) do
    final_header =
      header_filter(header, accepted)
    final_id =
      id
      |> String.to_integer()
    {_status, _header, lines} =
      path
      |> Sheet.load_all()
    lines
    |> flow_1(final_header)
    |> flow_2()
    |> flow_3(final_id)
  end

reduce

  def flow_1(enum, final_header) do
    enum
    |> Flow.from_enumerable()
    |> Flow.partition()
    |> Flow.reduce(fn -> [] end, fn {_x, y}, acc ->
      temp =
        final_header
        |> Enum.reduce(%{}, fn {a, b}, acc ->
          %{
            b => y |> Map.get(a)
          }
          |> Map.merge(acc)
        end)
      [temp]
      |> Enum.concat(acc)
    end)
    |> Enum.to_list()
  end

only unique rows

  def flow_2(enum) do
    enum
    |> Flow.from_enumerable()
    |> Flow.partition()
    |> Flow.uniq_by(fn x ->
      x["email"]
    end)
    |> Enum.to_list()
  end

some cleaning and calculation

  def flow_3(enum, final_id) do
    enum
    |> Flow.from_enumerable()
    |> Flow.partition()
    |> Flow.reduce(fn -> [] end, fn x, acc ->
      check_email =
        x["email"]
        |> email_fix()
      temp =
        case is_nil(check_email) do
          true ->
            nil
          false ->
            data =
              x
              |> Map.pop("email")
              |> elem(1)
            domain =
              check_email
              |> domain_from_email()
            final_data =
              data
              |> Map.merge(%{"domain" => domain})
              |> Sheet.attrs_validation(["country", "phone"])
            %{
              email: check_email,
              enabled: true,
              list_id: final_id,
              data: final_data
            }
        end
      [temp]
      |> Enum.concat(acc)
    end)
    |> Enum.to_list()
  end

2 comments

#performance #flow

2 1786 2

2018-11-27 11:34:03 UTC

Most Liked

david_ex

Check this out: Tuning Elixir GenStage/Flow pipeline processing | Tymon Tobolski

The graph stuff he explains here Measuring and visualizing GenStage/Flow with Gnuplot | Tymon Tobolski could help you visualize and tweak your setup (number of stages, demand size, etc.).

Post #2

Where Next?

View thread on forum (has 2 responses!)

performance

flow

Home Questions & Help>Questions

#performance #flow

3 1786 2

Last post

Popular in Questions

Questions & Help>Questions

(EXIT) no process: the process is not alive or there's no process currently associated with the given name, possibly because its application isn't started

I have an umbrella app. Some of the apps inside depend on other apps in the umbrella, unsurprisingly. I’m writing a test for one of the...

#otp #umbrella #testing #exunit #errors

26 21896 11

2019-05-08 23:43:18 UTC

New

Questions & Help>Questions

Mint vs Finch vs Gun vs Tesla vs HTTPoison etc

Currently suffering from paralysis by [HTTP client] analysis. This is rather unusual in Elixirland as there tends to be consensus on the ...

#http_client

252 22599 30

2024-02-11 02:32:24 UTC

New

Questions & Help>Questions

Where / How does the Mix environment variable get set?

I am trying to figure out how Mix knows whether the environment is test, dev, or prod – where is this set? Thanks.

#mix

31 23163 9

2018-09-11 12:46:52 UTC

New

Questions & Help>Questions

How do I use the Postgres JSONB / Postgrex JSON extension?

Hi all, I’ve just started learning Elixir and Phoenix Framework, so please pardon my n00bness at this stage. I’m trying to use Postgres...

#ecto

22 17759 9

2024-11-30 23:08:40 UTC

New

Questions & Help>Questions

What do you think of Gleam compared to Elixir?

I have a relationship of love and hate with Elixir. Lots of things are just absolutely right, but there are some things that are kind of ...

#programminguages #gleam

24 17666 10

2023-04-08 20:09:27 UTC

New

Questions & Help>Questions

Learning Elixir, frst impressions ( plz don't kill me ! )

About me? ( if you have nothing better to do than reading about some random guy in the internet :stuck_out_tongue: ) Hello all, this is ...

#learning-elixir

371 26424 61

2018-09-27 19:27:50 UTC

New

Questions & Help>Questions

(Postgrex.Error) ERROR 42804 (datatype_mismatch): column "" cannot be cast automatically to type integer

I have a User schema with a :from_id field set to type :string: defmodule TweetBot.Repo.Migrations.CreateUsers do use Ecto.Migration ...

#ecto

29 13741 4

2018-09-22 00:54:36 UTC

New

Questions & Help>Questions

Import a module from a file into IEX

What is the proper way to load a module from a file in to IEX? In the python world, doing something like this pretty standard: from ....

#iex

35 32699 16

2024-11-20 04:12:47 UTC

New

Questions & Help>Questions

Dialyzer: suppress warning on a specific function

In the Dialyzer docs ( dialyzer — OTP 29.0.2 (dialyzer 6.0.1) ), there is a way to turn off a specific warning for a function: -dialyzer...

#dialyzer

26 14074 7

2026-01-15 15:41:42 UTC

New

Questions & Help>Questions

How is it possible to get 2 million websocket connections when you have 65536 available ports?

I have a server on AWS, and was running a load test using artillery. When looking at the Phoenix dashboard I see the Ports going to 100% ...

/phoenix

20 19144 4

2023-01-24 00:21:16 UTC

New

Other popular topics

Questions & Help>Questions

How to set environment variables in dev.exs?

Hi All, I set a environment variables in dev.exs , like below code. when i start server, how can i set the ${enable} value? thanks. d...

/phoenix

31 22088 15

2021-03-16 00:58:41 UTC

New

Questions & Help>Questions

How to check Elixir version

I wanted to check elixir version in phoenix because i found that my elixir is 1.5 but when i use Enum.chunk_by it said the function is un...

#how-to-question

11 18937 3

2017-08-16 10:25:56 UTC

New

Questions & Help>Questions

System.get_env vs. Application.get_env

What is the difference between System.get_env and Application.get_env? For example, what are best practices to use one versus another.

#environment

13 17351 6

2023-08-30 16:24:51 UTC

New

Questions & Help>Questions

Put/update deep inside nested maps (and auto-create intermediate keys)

To my knowledge, put_in, Map.update etc. all have the one limitation of not automatically creating intermediate keys when needed (for exa...

#data-structures #maps #immutability

52 20488 11

2022-02-07 21:38:33 UTC

New

Questions & Help>Questions

How are you dealing with CSV files that uses CR line breaks?

I believe there are people here who are dealing with CSV files import on the daily basis, and since Excel is a really popular tool there ...

#how-to-question

10 18832 17

2017-08-25 15:34:14 UTC

New

Questions & Help>Questions

(Postgrex.Error) FATAL 28P01 (invalid_password) password authentication failed for user “postgres”

After calling mix ecto.create I get this error: 17:00:32.162 [error] GenServer #PID<0.412.0> terminating ** (Postgrex.Error) FATAL...

#ecto #postgres #troubleshooting

10 30238 20

2023-03-18 06:56:50 UTC

New

Questions & Help>Questions

How can I check Phoenix version?

Hello, how can I check the Phoenix version ? Thanks !

/phoenix

35 28338 8

2022-07-29 11:27:07 UTC

New

Questions & Help>Questions

No such input `xxxxx` for action ResourceName1.create

In the code below, if the create action is not set to accept “extra_key” as an input, it errors out with a message shown above. Is there ...

/ash

3 78691 2

2024-05-13 17:51:41 UTC

New

Chat & Discussions>Discussions

Can we beat Kafka if we build it in Elixir?

I am going through the kafka architecture. All the features what the kafka is providing are already in Erlang. I would like hear your opi...

#distributed-systems

96 20755 29

2021-01-01 23:37:01 UTC

New

Questions & Help>Questions

Transform a list into an map with indexes using Enum module

Hi, I need to transform a list of numbers into a map where the keys are the indexes and the values are the original values of the list. ...

35 32963 9

2016-09-01 23:06:05 UTC

New

Questions & Help>Questions

Why does <.form> discard method and CSRF token when :action is absent, and is action="" a reliable way to submit to the current URL?

Questions & Help>Questions

Help with elixir-ts-mode in doom-emacs config

Questions & Help>Questions

Are Vi keybindings possible inside IEx?

Questions & Help>Questions

I miss the ternary operator - does anyone have a macro that allows a ternary operator in Elixir code?

Questions & Help>Questions

Empty Result on Generic Action with graphql_unnested_unions

Questions & Help>Questions

Clarification about `assign/2,3` usage in `render/1` callbacks

Questions & Help>Questions

With the new 1.20 release does it change the way you see Gleam?

Questions & Help>Questions

Using Phoenix.LiveView.TagEngine as an EEx.Engine is deprecated!

Questions & Help>Questions

About ambiguity introduced in function default arguments

Questions & Help>Questions

OpenApiSpex schema - are there any naming conventions on handling show and index routes?

Questions & Help>Questions

Questions Questions ❯

Latest on Elixir Forum

Sloppy Joe Architecture Discussion

Chat & Discussions>Discussions

Mishka Chelekom - 0.0.9 released with 35 new headless component, MCP and more

News>News & Updates

Hex.Application error trying to run mix deps.get

Questions & Help>Troubleshooting

Ex-tauri - Desktop applications using Elixir

News>Announcing

The Architecture Behind Deploying Livebook Apps w/ Livebook Teams-Hugo Baraúna | ElixirConf US

Learning Resources>Talks

Why does <.form> discard method and CSRF token when :action is absent, and is action="" a reliable way to submit to the current URL?

Questions & Help>Questions

Could OTP handle the latency demands of a crypto order matching engine?

Chat & Discussions>Discussions

Nerves v1.15.0 released!

News>News & Updates

Green_ash - a keyboard-driven LiveView console to probe your Ash resources

News>Announcing

Practical Mentorship for a Stronger Community - Jordan Miller | ElixirConf US

Learning Resources>Talks

Amarula - a WhatsApp client in pure Elixir

News>Announcing

Comcent CE - an open-source voice/contact-center platform on Elixir/OTP, with call queues modeled as processes

News>Announcing

LT: smithy beam: Contract first API Development - Frank Eickhoff | ElixirConf EU

Learning Resources>Talks

BEAM There, Done That with Lukas Backström on Building the BEAM JIT

Blogs & Podcasts>Podcasts

Senior Software Engineer - Stord, Remote USA

Jobs & Member Profiles>Jobs

Elixir Forum ❯

Sub Categories:

Forums

We're in Beta

About us Mission Statement

FLOW - How to Enhance performance from a series of flows?

pedromvieira

FLOW - How to Enhance performance from a series of flows?

flow pipeline

reduce

only unique rows

some cleaning and calculation

Most Liked

david_ex

Where Next?

Popular in Questions

(EXIT) no process: the process is not alive or there's no process currently associated with the given name, possibly because its application isn't started

Mint vs Finch vs Gun vs Tesla vs HTTPoison etc

Where / How does the Mix environment variable get set?

How do I use the Postgres JSONB / Postgrex JSON extension?

What do you think of Gleam compared to Elixir?

Learning Elixir, frst impressions ( plz don't kill me ! )

(Postgrex.Error) ERROR 42804 (datatype_mismatch): column "" cannot be cast automatically to type integer

Import a module from a file into IEX

Dialyzer: suppress warning on a specific function

How is it possible to get 2 million websocket connections when you have 65536 available ports?

Other popular topics

How to set environment variables in dev.exs?

How to check Elixir version

System.get_env vs. Application.get_env

Put/update deep inside nested maps (and auto-create intermediate keys)

How are you dealing with CSV files that uses CR line breaks?

(Postgrex.Error) FATAL 28P01 (invalid_password) password authentication failed for user “postgres”

How can I check Phoenix version?

No such input `xxxxx` for action ResourceName1.create

Can we beat Kafka if we build it in Elixir?

Transform a list into an map with indexes using Enum module

Questions & Help>Questions

Latest on Elixir Forum

Sponsor Spotlight

Our Sponsors

Categories:

Sub Categories:

Forums

Popular Tags

Our Sponsors

We're in Beta