demorgan

Fetching tens of thousands of files regularly

Long time lurker here

Some background: I’ve been learning and playing with Elixir for about a year now. I did a few simpler projects with Phoenix, but now I wish to try something a bit harder.

As the title suggests one of the parts of the system I’m trying to build is going to fetch thousands of files regularly, perform some form of processing and then save any relevant information.

I think I have most of the parts figured out. There are plenty of good resources of doing APIs + DB and making that handle lots of requests from the outside.

What is the best way to handle this tho when the application itself generates that amount of work? For doing the processing I’m looking at Rust NIF, which looks promising & I’m curious to play around with it. My question is more around how to handle the logic around a large (and let’s imagine increasing) volume of work. I’ve looked at GenStage, but I’m not sure I understand enough to imagine how it would work. What would be the tradeoffs in Elixir between getting the work done fast making sure the app doesn’t crash itself.

Another thing I’m trying to wrap my head around is: How to write this without limiting the application in the way that it’s written. If a worker pool is used that might work, but what if the app is hosted on GCP or AWS and the machine can be scaled vertically, the app wouldn’t scale without changing the code?

How can I exploit the concurrency here optimally? How does one determine the boundaries in such cases and then make sure they are respected?

I’m not sure I’m explaining well enough here, that’s probably due to my level of understanding =)

22 comments

#concurrency

23 2950 22

2020-03-13 23:09:28 UTC

Most Liked

dimitarvp

General advice: stay away from NIFs for as long as humanly possible.

Even if your processing code ends up being slow and inefficient, stick with it for a while. Don’t introduce much more complexity until you’ve shaped a working solution.

I’d recommend GenStage and Broadway generally but in order to stick to simplicity I’d much more readily recommend you just look at the docs of Task.async_stream (3-args and 5-args).

Post #4

lpil

Creator of Gleam

That sounds like not much data in an hour! Perhaps try the simplest and easiest implementation and see how that works. Best not to over engineer when it might not be required.

Post #8

dimitarvp

Erlang’s crypto package is a very good start.

Summoning @voltone for additional options, if he doesn’t mind.

Post #18

Where Next?

View thread on forum (has 22 responses!)

concurrency

Home Questions & Help>Questions

#concurrency

23 2950 22

Last post

Popular in Questions

Questions & Help>Questions

How can I write a raw sql query?

Hi, I have to write a raw query for one of my project. But till now I have used ecto queries and don’t have much experience writing raw ...

/phoenix #ecto

13 19750 20

2020-04-12 00:15:10 UTC

New

Questions & Help>Questions

Params in the URL and body -- how does Phoenix handle them together?

If I have a post route which an argument: post /my_post_route/:my_param1, MyController.my_post_handler How would get the post params ...

/phoenix #params

17 27034 14

2018-06-13 21:38:48 UTC

New

Questions & Help>Questions

How to describe many contexts in ExUnit without a hierarchy

ExUnit now has describe blocks which is a welcome addition coming from RSpec. In the docs, it states that nested hierarchies of describe ...

#testing #exunit

66 16996 4

2017-03-25 17:02:28 UTC

New

Questions & Help>Questions

Convert a string to an integer?

Could someone help me? I’m making my first elixir program, number guessing game. I can’t figure out how to convert the user’s guess from ...

#how-to-question

3 14650 6

2017-04-25 23:42:13 UTC

New

Questions & Help>Questions

Ecto query using like/ilike in query

Good day to you all. I have been struggling to get a query involving like and ilike to work. Can anyone assist me on this, please? pro...

#ecto

17 16956 10

2022-09-15 19:56:29 UTC

New

Questions & Help>Questions

Why isn’t mnesia the most preferred database for use in Elixir/Phoenix?

Why is it that the mnesia database isn’t the most preferred database for use in Elixir/Phoenix?

/phoenix #mnesia

124 21659 22

2020-04-29 21:46:52 UTC

New

Questions & Help>Questions

Failed to run 'elixir' command error in vs code

Using vs code and installed ElixirLS: support and debugger. And I got an error popped up on start up says Failed to run ‘elixir’ comma...

#vscode #elixir-ls

49 16657 39

2025-08-20 18:57:04 UTC

New

Questions & Help>Questions

No such input `xxxxx` for action ResourceName1.create

In the code below, if the create action is not set to accept “extra_key” as an input, it errors out with a message shown above. Is there ...

/ash

3 78654 2

2024-05-13 17:51:41 UTC

New

Questions & Help>Questions

(Postgrex.Error) ERROR 42804 (datatype_mismatch): column "" cannot be cast automatically to type integer

I have a User schema with a :from_id field set to type :string: defmodule TweetBot.Repo.Migrations.CreateUsers do use Ecto.Migration ...

#ecto

29 13673 4

2018-09-22 00:54:36 UTC

New

Questions & Help>Questions

How is it possible to get 2 million websocket connections when you have 65536 available ports?

I have a server on AWS, and was running a load test using artillery. When looking at the Phoenix dashboard I see the Ports going to 100% ...

/phoenix

20 19015 4

2023-01-24 00:21:16 UTC

New

Other popular topics

Questions & Help>Questions

How to set environment variables in dev.exs?

Hi All, I set a environment variables in dev.exs , like below code. when i start server, how can i set the ${enable} value? thanks. d...

/phoenix

31 22048 15

2021-03-16 00:58:41 UTC

New

Questions & Help>Questions

Erlang and Elixir on Apple Silicon/M1 Chip

Hello all! I am typing this post from my new MacBook Pro with the M1 chip. I’m loving it so far, and will probably use it as my daily dr...

#erlang #troubleshooting

121 25150 65

2023-07-05 21:22:36 UTC

New

Questions & Help>Questions

Put/update deep inside nested maps (and auto-create intermediate keys)

To my knowledge, put_in, Map.update etc. all have the one limitation of not automatically creating intermediate keys when needed (for exa...

#data-structures #maps #immutability

52 20442 11

2022-02-07 21:38:33 UTC

New

Questions & Help>Questions

Best way to send multiple files as HTTP response

I have a phoenix application from which a user can download multiple(5-6) files of size 1MB. I couldn’t find anything related to sending ...

/phoenix #api

3 19039 3

2018-11-07 11:39:20 UTC

New

Questions & Help>Questions

How to decode a JSON into a struct safely?

What’s the safe way to decode a JSON string into a struct? I want to avoid calling String.to_atom. Jason.decode can give me a map with st...

#structs #json

29 21253 26

2022-11-01 19:09:59 UTC

New

Questions & Help>Questions

No such input `xxxxx` for action ResourceName1.create

In the code below, if the create action is not set to accept “extra_key” as an input, it errors out with a message shown above. Is there ...

/ash

3 78654 2

2024-05-13 17:51:41 UTC

New

Questions & Help>Questions

WebSocket is closed before the connection is established

I’ve got an issue with an app and I’ve no idea of how to troubleshoot it. I’m hoping someone here might have seen something similar. I p...

/phoenix #troubleshooting #liveview

13 27704 22

2025-09-26 14:04:44 UTC

New

Questions & Help>Questions

IEX in Windows Powershell?

Hi. I’ve noticed that Windows Powershell has it’s own IEX command and you cannot access Elixir’s IEX due to the conflict. This isn’t a cr...

#iex #microsoft-windows #windows #powershell

15 30503 4

2018-06-09 16:59:36 UTC

New

Chat & Discussions>Discussions

ElixirLS - the Elixir Language Server

TL;DR: I’ve just released an implementation of Microsoft’s IDE-independent Language Server Protocol for Elixir. It adds language support ...

#elixir-ls

1144 54120 245

2026-06-09 16:10:09 UTC

New

Questions & Help>Questions

How To Implement if...else if...else condition

Hi everyone! I need implement if…else if…else condition from my elixir code, and anymore of this control flow structures not work proper...

#how-to-question

40 52356 6

2017-08-23 10:29:43 UTC

New

Questions & Help>Questions

Help with elixir-ts-mode in doom-emacs config

Questions & Help>Questions

Are Vi keybindings possible inside IEx?

Questions & Help>Questions

I miss the ternary operator - does anyone have a macro that allows a ternary operator in Elixir code?

Questions & Help>Questions

Empty Result on Generic Action with graphql_unnested_unions

Questions & Help>Questions

Clarification about `assign/2,3` usage in `render/1` callbacks

Questions & Help>Questions

With the new 1.20 release does it change the way you see Gleam?

Questions & Help>Questions

Using Phoenix.LiveView.TagEngine as an EEx.Engine is deprecated!

Questions & Help>Questions

About ambiguity introduced in function default arguments

Questions & Help>Questions

OpenApiSpex schema - are there any naming conventions on handling show and index routes?

Questions & Help>Questions

How to get type warnings before test failure reports

Questions & Help>Questions

Questions Questions ❯

Latest on Elixir Forum

LT: Your project is great! More people should know about it - Kamila Pokój | ElixirConf EU (possible duplicate)

Learning Resources>Talks

Comcent CE - an open-source voice/contact-center platform on Elixir/OTP, with call queues modeled as processes

News>Announcing

LT: smithy beam: Contract first API Development - Frank Eickhoff | ElixirConf EU

Learning Resources>Talks

BEAM There, Done That with Lukas Backström on Building the BEAM JIT

Blogs & Podcasts>Podcasts

Senior Software Engineer - Stord, Remote USA

Jobs & Member Profiles>Jobs

Hyper - distributed Firecracker microVM orchestrator written in Elixir

News>Announcing

Just_bash - a bash interpreter + virtual filesystem in Elixir (and how we use it to power an agent in production)

Chat & Discussions>AI / LLMs

Update from the Erlang Ecosystem Foundation - Dan Janowski | ElixirConf EU

Learning Resources>Talks

RFC 10008 - HTTP QUERY method: any plans for Plug/Cowboy support?

Chat & Discussions>Discussions

Localize bindings for Lua, LFE, Erlang and Gleam

News>Announcing

Attesto - OpenID-certified OAuth 2.1 / OpenID Connect for Elixir (Phoenix provider, client, and MCP auth)

News>Announcing

Improv - BLE Wi-Fi provisioning for Elixir/Nerves devices

News>Announcing

Andrew (Nature) Okoye - Senior Full Stack Engineer (Elixir, Phoenix, React) | Remote

Jobs & Member Profiles>Member Profiles

Annotai - turn UI annotations into structured context that AI agents can act on

News>Announcing

Cfonb - a parser for CFONB, the French banking statement format

News>Announcing

Elixir Forum ❯

Sub Categories:

Forums

We're in Beta

About us Mission Statement

Fetching tens of thousands of files regularly

demorgan

Fetching tens of thousands of files regularly

Most Liked

dimitarvp

lpil

dimitarvp

Where Next?

Popular in Questions

How can I write a raw sql query?

Params in the URL and body -- how does Phoenix handle them together?

How to describe many contexts in ExUnit without a hierarchy

Convert a string to an integer?

Ecto query using like/ilike in query

Why isn’t mnesia the most preferred database for use in Elixir/Phoenix?

Failed to run 'elixir' command error in vs code

No such input `xxxxx` for action ResourceName1.create

(Postgrex.Error) ERROR 42804 (datatype_mismatch): column "" cannot be cast automatically to type integer

How is it possible to get 2 million websocket connections when you have 65536 available ports?

Other popular topics

How to set environment variables in dev.exs?

Erlang and Elixir on Apple Silicon/M1 Chip

Put/update deep inside nested maps (and auto-create intermediate keys)

Best way to send multiple files as HTTP response

How to decode a JSON into a struct safely?

No such input `xxxxx` for action ResourceName1.create

WebSocket is closed before the connection is established

IEX in Windows Powershell?

ElixirLS - the Elixir Language Server

How To Implement if...else if...else condition

Questions & Help>Questions

Latest on Elixir Forum

Sponsor Spotlight

Our Sponsors

Categories:

Sub Categories:

Forums

Popular Tags

Our Sponsors

We're in Beta