Very high memory usage when streaming file with Phoenix

no_one · June 27, 2019, 5:22pm

Hello, I have discovered Elixir recently and I have decided to do some projects with it (and Phoenix).

I need to take file from one server and stream it to client through my server. I wanted to do it with streams. My HttpStream looks almost exactly as this: https://labs.civilcode.io/elixir/2018/05/03/stream-files.html.

Streaming works but when I want to access bigger file (for example ~4 GB movie) my application consumes large amounts of memory, and after ~20 seconds video stops playing.

My controller’s function looks like this:

def show(conn, _params) do
    url = "http://localhost:8080/Bigfile.mp4"
    %{headers: proxy_headers} = HTTPoison.head!(url)
    proxy_headers = for {k, v} <- proxy_headers, do: {String.downcase(k), v}, into: %{}

    chunked_conn =
      conn
      |> put_resp_content_type("video/mp4")
      |> put_resp_header("content-length", proxy_headers["content-length"])
      |> send_chunked(200)

    url
    |> HttpStream.stream()
    |> Stream.map(fn n -> chunk(chunked_conn, n) end)
    |> Stream.run()
  end

It’s not beautiful but I’ll improve that later. From my observations chunk/2 is called too fast (if I insert :timer.sleep(1) before stream_next(resp) memory usage is normal but it’s much slower).

This is memory usage from :observer (images are from imgur and they appear to be cropped here):

And very interesting thing is that one process has very long message queue:

How to fix this?

NobbZ · June 27, 2019, 5:24pm

Only from reading roughly your post, not the linked code… But it seems as if data comes faster as you are able/willing to process it.

Perhaps you need some rate limiting on the reading end? But probably you need to change a lot of code to do so.

no_one · June 27, 2019, 6:27pm

My bad, here is the code: https://github.com/Arquanite/phoenix-streaming-app

I hope there is a way to get more data from original server only after data is sent to client (normal http servers must be working similar way?).

OvermindDL1 · June 27, 2019, 6:38pm

╰─➤  /usr/bin/time -f "mem=%K RSS=%M elapsed=%E cpu.sys=%S user=%U" -- mix test 
..

Finished in 0.08 seconds
2 tests, 0 failures

Randomized with seed 92121
mem=0 RSS=52428 elapsed=0:01.09 cpu.sys=0.28 user=1.10

I’m running your tests but I’m not seeing high memory usage here. What’s the specific test to run that shows the problem?

I’m still thinking the issue is just what @NobbZ said, that the data is being received and stored without being throttled faster than it can resend it out.

no_one · June 27, 2019, 6:46pm

There’s no tests for this (I don’t have much experience in testing and don’t know how to test this without uploading really big binary somewhere).

I’m testing this this way:
http-server (from npm) is listening on port 8080 (and serving my video file named “Bigfile.mp4”). Then I open my browser on localhost:4000/api/files/1 (id doesn’t matter) and it starts loading video.

As you can see there is really small amount of code so I can make something new that will be more efficient. But I don’t know how.

OvermindDL1 · June 27, 2019, 6:59pm

Ah, no problem then. And to get a ‘big infinite set of data’ can just read /dev/random or so. ^.^

I took a look and it seems you are using HTTPoison, which uses hackney behind it, and as I recall it does use active: :once on the TCP stack when you pass in active: :once to HTTPoison, so that should be fine… Maybe it’s some growing memory somewhere rather than something actually being stored…

At this point I’d really use :observer to see which process is allocating that memory then run a GC on that process, if that lowers the memory then it’s just unused memory that hasn’t used enough of the system memory to cause a GC within it’s time yet (and there are a few fixes for this, but eh). If it’s actually allocated memory though then something is holding on to it, which could be hackney, httpoison, or Stream from what I see in your code, and I doubt it would be Stream. Let’s look at the consumer perhaps…

Hmm, in your controller:

    chunked_conn =                                                                    
      conn                                                                            
      |> put_resp_content_type("video/mp4")                                           
      |> put_resp_header("content-length", proxy_headers["content-length"])           
      |> send_chunked(200)                                                            
                                                                                      
    url                                                                               
    |> HttpStream.stream()                                                            
    |> Stream.map(fn n -> chunk(chunked_conn, n) end)                                 
    |> Stream.run()                                                                   
  end

Hmm, as I recall the response body get’s accumulated in the conn.resp_body, but that’s not being accumulated here. I do know that conn can be used as an ‘into’ so the whole streaming part could be replaced with:

    url
    |> HttpStream.stream()
    |> Stream.into(chunked_conn)
    |> Stream.run()

However I think that might accumulate the body, not sure…

I haven’t used chunks in plug yet outside of trivial things… Hmm…

OvermindDL1 · June 27, 2019, 7:03pm

Oh wait!!!

It’s Stream.map! It’s storing all past sent data!

Just replace Stream.map in your existing code with Stream.each so it doesn’t save the result. (Stream.into might work too? If it doesn’t accumulate, I’m not sure)

no_one · June 27, 2019, 7:14pm

HTTPoison is doing good job, when i replace chunk with other code the memory usage is fine.
I used :observer and it shows which process is consuming memory (screen is in first post). It’s something related to cowboy. I’m really a beginner but I think the problem is with this long message queue (and gc won’t help with it?).

And Stream.each does not improve anything :<

OvermindDL1 · June 27, 2019, 7:17pm

Well you definitely want each instead of map there as map would have included all past sent data, so… ^.^;

If ‘each’ doesn’t help, maybe try into(chunked_conn)? I’m really surprised each didn’t fix it…

no_one · June 27, 2019, 7:20pm

Still nothing, it loads whole file into memory (it uses 4.6 GB of RAM, almost exact size of my video)

EDIT: Maybe I should use something else than conn, but what and how integrate it into Phoenix app?

wanton7 · June 27, 2019, 8:20pm

You should change your controller function to return conn. Now it returns whatever Stream.run() is returning. So change part of your code to

    conn =
      conn
      |> put_resp_content_type("video/mp4")
      |> put_resp_header("content-length", proxy_headers["content-length"])
      |> send_chunked(200)

    url
    |> HttpStream.stream()
    |> Stream.each(fn n -> chunk(conn, n) end)
    |> Stream.run()

    conn

Not sure it will help with memory problem, but every controller function needs to return Plug.Conn.t()

wanton7 · June 27, 2019, 11:18pm

Nice I accidentally deleted my reply and forum says “You’ve performed this action too many times. Please wait 23 hours before trying again.” when I try to undelete. I’m pretty sure I haven’t undeleted anything before and post will deleted 24 hours

Here are those links again

no_one · June 28, 2019, 5:23am

I’ve implemented this like in linked issue and now it works.

wanton7 · June 28, 2019, 6:21am

I think that’s temporary solution because if you have multiple streaming connections that mailbox might be filling from all of them and checking mailbox size might not be a very good solution. Hopefully this gets fixed in Cowboy 2.7. I read that Cowboy 1.x isn’t effected because chunk is synchronous function in 1.x. Maybe you could also try downgrading to 1.x in the meantime, if that’s possible with Phoenix.

benwilson512 · June 28, 2019, 6:58pm

Once people start with streams it’s common to think they need to keep using a stream. The code should be:

    url
    |> HttpStream.stream()
    |> Enum.reduce(conn, fn n, conn -.
      chunk(conn, n) 
    end)

This ensures that the controller action returns a conn, and it returns specifically the conn that has sent all the chunks, which is what you want. It also avoids accumulating all the data in memory.

HttpStream still needs to be implemented in a way that doesn’t overload the process with messages though.

wanton7 · June 28, 2019, 8:04pm

It seems that you didn’t read this thread before replying. Problem doesn’t have anything to do with that code, there is a bug in Cowboy 2.x that is causing this. Look at my earlier post with links. Calling chunk pushed messages to cowboy_clear:connection_process’s mailbox and returns. That means there is no back pressure like in Cowboy 1.x where call to chunk was a synchronous. Mailbox keeps filling up because file chunks are coming in fast cowboy_clear:connection_process process gets slow and huge amount memory is allocated.

If you want to try it yourself @no_one provided code in Github in some posts back.

Also are you sure it’s ok to use Enum.reduce with streams? I’m pretty sure that will load whole file in to memory because it will create intermediate list as you can see in examples here Stream — Elixir v1.16.0

Edited: I think I was wrong and using Enum.reduce doesn’t cause any intermediate list to be created because it returns a single value so it’s probably OK in that code.

wanton7 · June 30, 2019, 8:48pm

Tried your code and downgraded {:plug_cowboy, "~> 2.0"} to {:plug_cowboy, "~> 1.0"} in deps and it seems to fix this problem and memory doesn’t increase anymore.