alexcastano

Stream consumes much more memory

Hello everybody,

I’m creating an HTTP wrapper to download files Stream. Currently, it works with :httpc, :hackney and :ibrowse and has the following functions:

stream/2 which creates an Elixir Stream.
read/2 which get the content and returns a string.
download/2 which downloads to a file.

I realized that I can use just stream/2 to implement the other two functions. This would simplify a lot the library code, but before making the change I just wanted to benchmark. I’m surprised with the results for a simple test with a 1MB file. If I use the stream/2 function the memory consumption is much higher (with the same speed):

$  mix run benchs/download.exs
...

Memory usage statistics:

Name                     average  deviation         median         99th %
download                 3.18 KB     ±1.00%        3.20 KB        3.20 KB
stream                  32.58 KB     ±0.40%       32.62 KB       32.67 KB
stream_file_open        41.06 KB     ±0.44%       41.12 KB       41.17 KB

Comparison:
download                 3.20 KB
stream                  32.62 KB - 10.21x memory usage
stream_file_open        41.12 KB - 12.87x memory usage

$ mix run benchs/read.exs
...

Memory usage statistics:

Name           Memory usage
read_stream        26.39 KB
read                3.29 KB - 0.12x memory usage

The read benchmark is something like this:

read = fn -> {:ok, r} = Down.read(url, backend: :httpc); r end
read_stream = fn ->
  url
  |> Down.stream(backend: :httpc)
  |> Enum.into([])
  |> IO.iodata_to_binary()
end

The Down.read/2 function internally uses a very similar strategy to the read_stream function. Every time it receives a new chunk it appends to a list and when it finishes, it calls to the same IO.iodata_to_binary/1:

With the download function, the situation is very similar. Instead of inserting chunks in a list, I just write them to a file.

The Down.stream/2 function just sends received chunks to the PID using Stream.resource/3:

down/lib/down.ex at master · alexcastano/down · GitHub

The benchmarks are here:

The full code can be found in GitHub - alexcastano/down: Elixir library for streaming, flexible and safe downloading of remote files · GitHub
I don’t use any buffer or any caching system.

If someone wants to make some test I use the following docker command to have an HTTP test server:

$ docker run --name httpbin --restart=unless-stopped -p 6080:80 -d kennethreitz/httpbin

I cannot find an explanation for this, at the moment. So my questions are:

Do you know why is this happening?
Should I be worried about this or 33K is acceptable? Of course, it would be nicer 3KB
Are Benchee memory benchmarks accurate?

Thanks in advance.

4 comments

#memory

5 1789 4

2018-10-24 18:19:42 UTC

Most Liked

OvermindDL1

Stream uses function thunks, which will use more memory, plus there is more passing of data, so the GC will need to run more often, though it doesn’t run often as it so it can reclaim it all en masse later, so it is using available RAM to make the operation faster. Remember, only use Stream when you need true unbounded or unknown bound operations or when your overall structure exceeds available memory, else keep with immediate constructs.

And yeah, 33k is basically nothing for that kind of stuff.

Post #2

alexcastano

Well, I’m trying to do a generic library to stream HTTP request. So the main reason to use it is that you can use the data piece by piece. Another reason to use it could be to avoid an attack of huge files.

I’m not sure if I follow you here. The library can use the streaming options of :hackney, :ibrowse and :httpc. It checks the size of every chunk is received. When the sum of the chunk sizes is bigger than the given limit, it stops the download. It also checks the header size.

Exactly, you’re right. When the library receives a new chunk and it is written straight to the file, it only consumes 4KB. When I use a stream like this:

file = Temp.path!() |> File.open!([:write, :delayed])

  url
  |> Down.stream(backend: :httpc)
  |> Stream.each(fn c -> IO.binwrite(file, c) end)
  |> Stream.run()

File.close(file)

or like this:

  file_stream = Temp.path!() |> File.stream!([:write, :delayed])

  url
  |> Down.stream(backend: :httpc)
  |> Stream.into(file_stream)
  |> Stream.run()

is when it consumes so much memory.

It is open source: https://github.com/alexcastano/down/blob/master/benchs/download.exs

But of course, I know it is too much to ask you to take a look

If you can give me any advice to debug where the memory is consumed would be great.

Thank you for your time

Post #5

Where Next?

View thread on forum (has 4 responses!)

memory

Home Questions & Help>Questions

#memory

5 1793 4

Last post

Popular in Questions

Questions & Help>Questions

Web scraping tools

I want to try my hand at web scraping. What tools/libraries do I need to use. I’m hoping to turn this into something professional so don’...

#web-scraping

123 19532 45

2021-04-30 08:10:13 UTC

New

Questions & Help>Questions

How are you dealing with CSV files that uses CR line breaks?

I believe there are people here who are dealing with CSV files import on the daily basis, and since Excel is a really popular tool there ...

#how-to-question

10 18614 17

2017-08-25 15:34:14 UTC

New

Questions & Help>Questions

How can I check Phoenix version?

Hello, how can I check the Phoenix version ? Thanks !

/phoenix

35 28221 8

2022-07-29 11:27:07 UTC

New

Questions & Help>Questions

Protocol Enumerable not implemented for

I am trying to implement my new.html.eex file to create new posts on my website. new.html.eex: <h1>Create Post</h1> <%= ...

/phoenix

4 14941 7

2018-02-12 18:09:58 UTC

New

Questions & Help>Questions

(Postgrex.Error) ERROR 42804 (datatype_mismatch): column "" cannot be cast automatically to type integer

I have a User schema with a :from_id field set to type :string: defmodule TweetBot.Repo.Migrations.CreateUsers do use Ecto.Migration ...

#ecto

29 13524 4

2018-09-22 00:54:36 UTC

New

Questions & Help>Questions

Import a module from a file into IEX

What is the proper way to load a module from a file in to IEX? In the python world, doing something like this pretty standard: from ....

#iex

35 32626 16

2024-11-20 04:12:47 UTC

New

Questions & Help>Questions

Hex version - ** (Mix) The task "phx.new" could not be found

I am trying to start a new phoenix project with elixir 1.9, but mix phx.new does not work. It says that ** (Mix) The task "phx.new" could...

/phoenix #hex

66 25407 15

2020-06-03 14:53:30 UTC

New

Questions & Help>Questions

Good auth solutions for Elixir/Phoenix?

There are pre-rolled solutions for other frameworks that do work. However, Phoenix does not seem to have these. Have people had good expe...

/phoenix #authentication

48 21542 20

2018-05-04 23:15:51 UTC

New

Questions & Help>Questions

How to get struct from map - elixir?

Lets say I have map like this fetching from my database %{"_id" => #BSON.ObjectId<58eb1a7a9ad169198c3dXXXX>, "email" => ...

/phoenix #ecto #maps #structs

38 34876 34

2025-08-22 12:15:57 UTC

New

How to use return statement with if condition in elixir?

lets say i have a sample like a = 20; b = 10; if (a > b) do {:ok, "a"} end if (a < b) do {:ok, b} end if (a == b) do {:ok, "equa...

/phoenix

10 20190 6

2017-05-05 14:47:06 UTC

New

News>Announcing

Kaffy - a quick and flexible admin interface for phoenix applications

Hello guys, I have finally made it. I created an admin interface for a framework. It’s been on my todo list for years and with the curre...

/phoenix #ecto #library #admin-tools #kaffy

301 24296 97

2025-09-20 21:58:54 UTC

New

Questions & Help>Questions

What are the main benefits of Elixir compared to Clojure?

Hey, Just curious what are the main benefits of Elixir compared to Clojure? When is Elixir more useful than Clojure and vice versa? Th...

#clojure

229 33752 47

2024-07-22 18:04:13 UTC

New

Questions & Help>Questions

How do I kill a process ` #PID<0.186.0` in iex?

When I run the Plug and I recompile I wind up having to use Ctrl C to quit iex and start again. Witht the help of rlwrap I can use the cu...

#iex #processes #how-to-question

37 21444 8

2018-08-10 12:36:11 UTC

New

Questions & Help>Questions

Docker run Error loading shared library libstdc++.so.6 and libgcc_s.so.1

I am trying to run a deploy with docker and I successfully runned with this command: docker build -t romenigld/blog-prod . but when I t...

/phoenix #deployment #docker-compose

30 20317 15

2022-06-16 15:43:30 UTC

New

Questions & Help>Questions

IEX in Windows Powershell?

Hi. I’ve noticed that Windows Powershell has it’s own IEX command and you cannot access Elixir’s IEX due to the conflict. This isn’t a cr...

#iex #microsoft-windows #windows #powershell

15 30117 4

2018-06-09 16:59:36 UTC

New

Questions & Help>Questions

Help with elixir-ts-mode in doom-emacs config

Questions & Help>Questions

Are Vi keybindings possible inside IEx?

Questions & Help>Questions

I miss the ternary operator - does anyone have a macro that allows a ternary operator in Elixir code?

Questions & Help>Questions

Empty Result on Generic Action with graphql_unnested_unions

Questions & Help>Questions

Clarification about `assign/2,3` usage in `render/1` callbacks

Questions & Help>Questions

With the new 1.20 release does it change the way you see Gleam?

Questions & Help>Questions

Using Phoenix.LiveView.TagEngine as an EEx.Engine is deprecated!

Questions & Help>Questions

About ambiguity introduced in function default arguments

Questions & Help>Questions

OpenApiSpex schema - are there any naming conventions on handling show and index routes?

Questions & Help>Questions

How to get type warnings before test failure reports

Questions & Help>Questions

Questions Questions ❯

Latest on Elixir Forum

Keynote: DurableServer: Always Running Somewhere - Chris McCord | ElixirConf EU

Learning Resources>Talks

finance - XIRR, NPV and other financial calcs matching Excel/Sheets

News>Announcing

Oaskit 0.14.1 - security release

News>News & Updates

API Management Console - runtime route toggling for Phoenix apps

News>Announcing

Testers wanted: protocol pruning for smaller client bundles

News>RFCs

Update from the Phoenix Team - Steffen Deusch | ElixirConf EU

Learning Resources>Talks

Mob 0.7.14 released!

News>News & Updates

LT: Sherlock: The truth is in the code - Aleksandr Lossenko | ElixirConf EU

Learning Resources>Talks

CI workflows on Tangled for Elixir

Blogs & Podcasts>Blog Posts

Help with elixir-ts-mode in doom-emacs config

Questions & Help>Questions

LT: The Elixir Hiring Paradox - Arjun Gillard | ElixirConf EU

Learning Resources>Talks

BEAM There, Done That with Annette Bieniusa & Guillaume Duboc - Typing Erlang & Elixir After 30 Years

Blogs & Podcasts>Podcasts

Are Vi keybindings possible inside IEx?

Questions & Help>Questions

Mob 0.7.13 released!

News>News & Updates

Finitomata v0.41.0 released!

News>News & Updates

Elixir Forum ❯

Sub Categories:

Forums

We're in Beta

About us Mission Statement

Stream consumes much more memory

alexcastano

Stream consumes much more memory

Most Liked

OvermindDL1

alexcastano

Where Next?

Popular in Questions

Web scraping tools

How are you dealing with CSV files that uses CR line breaks?

How can I check Phoenix version?

Protocol Enumerable not implemented for

(Postgrex.Error) ERROR 42804 (datatype_mismatch): column "" cannot be cast automatically to type integer

Import a module from a file into IEX

Hex version - ** (Mix) The task "phx.new" could not be found

Good auth solutions for Elixir/Phoenix?

How to get struct from map - elixir?

Why would I choose Elixir as a general purpose programming language?

Other popular topics

How to set environment variables in dev.exs?

Oban - Reliable and Observable Job Processing

Phoenix v1.3.0-rc.0 released

How to decode a JSON into a struct safely?

How to use return statement with if condition in elixir?

Kaffy - a quick and flexible admin interface for phoenix applications

What are the main benefits of Elixir compared to Clojure?

How do I kill a process ` #PID<0.186.0` in iex?

Docker run Error loading shared library libstdc++.so.6 and libgcc_s.so.1

IEX in Windows Powershell?

Questions & Help>Questions

Latest on Elixir Forum

Sponsor Spotlight

Our Sponsors

Categories:

Sub Categories:

Forums

Popular Tags

Our Sponsors

We're in Beta