HTTP client libraries and wrappers

chulkilee · August 18, 2018, 5:57pm

Here are the list of HTTP client libraries/wrappers, and some thoughts on HTTP client in general. I’d like to hear from others how they work with HTTP…

HTTP client libraries

http://erlang.org/doc/man/httpc.html - erlang (part of OTP)
https://github.com/benoitc/hackney - erlang, socket pools
https://github.com/cmullaparthi/ibrowse - erlang, pool/pipeline per destination
https://github.com/puzza007/katipo - erlang, libcurl
https://github.com/ninenines/gun - erlang, HTTP/2, Websocket, keeping connection with supervisor
https://github.com/inaka/shotgun - erlang, on the top of gun, out-of-box support of Server-sent Events

HTTP Client Wrappers

https://github.com/teamon/tesla - elixir, support httpc, hackney, ibrowse
https://github.com/edgurgel/httpoison - elixir, hackney
https://github.com/myfreeweb/httpotion - elixir, ibrowse
https://github.com/alexandrubagu/simplehttp - elixir, httpc

Thoughts & Questions

Although HTTP spec says headers are case-insensitive, http client libraries should not automatically downcase such values (especially for outgoing request) since there are applications require specific cases
headers, form data, and query string should be a list not a map to preserve orders (both order of keys and order of values)
For performance, what about using NIF to parse headers? For example, puma (app server in Ruby) uses C for this: https://github.com/puma/puma/tree/v3.12.0/ext/puma_http11 - such parser may be shared across client/server.
How should HTTP libraries handle HTTP version upgrade?
Should HTTP libraries make pure functional (zero side effect) or leverage more global states (connection pools, keeping connection, etc.)?
- For example, Tesla allows creating new client on the fly, so that it “builds” client without any configuration from “global” config; however as it may use HTTP client library (application) which maintains some state in it.

gregvaughn · August 18, 2018, 9:03pm

In the meta-sense of the question, yeah, I’m not loving the fragmentation in http client libraries, however, it’s not a huge detriment to my day to day experience. I’d need to see some specific use cases to get behind some of your arguments about case and ordering, but it’s not a big deal because none of those things you’re advocating would work against any use case I’ve dealt with.

In the end, are you expecting someone else to write this ultra http client library, or are you trying to determine whether to write it yourself?

chulkilee · August 18, 2018, 9:34pm

So far I’m happy with Tesla (with httpc and hackeny) for small traffic. However for future it would be nice if I can use one library for http2 and further so this is my part of research.

For ordering - some services require signature of headers so it is required to keep the order.

voughtdq · August 18, 2018, 10:22pm

I was wondering if there was a library that leveraged libcurl, and there is! It’s actively maintained too. I’m going to give it a try later to see if it works.

dimitarvp · August 19, 2018, 1:14am

Why not do some code generation and make use of Elixir’s pattern matching, like so?

  def header_pair("Content-Encoding" <> rest), do: {"Content-Encoding", header_value(rest)}
  def header_pair("If-None-Match" <> rest),    do: {"If-None-Match",    header_value(rest)}
  # ^ these can be generated with a macro

  def header_value(val) do
    String.split(val, ~r/\s*:\s*/, trim: true)
    |> hd
  end

I am sure C code will be faster, however crossing the VM ↔ native barrier has an overhead as well. I have not measured the native approach but I would turn the question back to you: have you established with certainty that Elixir HTTP header parsing is a bottleneck in your workflow?

(EDIT: on a second thought, this is not practical if we want to accept all possible case combinations for the headers.)

BTW, awesome job compiling the list!

voughtdq · August 19, 2018, 12:20pm

Good news and bad news for Katipo.

Bad news: It depends on a metrics library that is, as far as I can tell, out of date. It uses merl to dynamically build a metrics module and it’s not getting through erl_lint on OTP 21.

Good news: The metrics module can easily be faked so that there are no problems making requests.

Here’s the fake module:

defmodule :metrics_mod do
  def new(_name, _type, _config), do: :ok
  def update(_name, _probe, _config), do: :ok
  def update_or_create(_, _, _), do: :ok
  def update_or_create(_name, _probe, _type, _config), do: :ok
  def delete(_name, _config), do: :ok
end

chulkilee · August 20, 2018, 4:19am

I haven’t work with NIF, and I’m not arguing it’s a bottleneck.

However, as it may be called so many times, so even small improvement may give significant improvement (like JSON) in some cases. I guess it depends at which boundary NIF is used - for example, using NIF to parse each HTTP header may not worth it. However, we may overage NIF to take a stream (or string) of a HTTP response, and let it returns the whole parsed results - maybe less memory copying?

chulkilee · November 7, 2018, 11:11pm

From ElixirConf 2018 keynote - see https://youtu.be/suOzNeMJXl0?t=2207

tangui · November 8, 2018, 4:38pm

Question here: in your opinion should client HTTP libs optionally deal with response caching, i.e. dealing with the cache-control, pragma, expires, etc. headers, and cache the response automatically? Or should it better be done by the application?
There’s for example that HTTPoison issue from 2 years ago. Not sure if it’d be a sane approach.

chulkilee · November 12, 2018, 6:06am

All libs should have “minimal” default behavior. I don’t think HTTP cache or auto-retry for idempotent requests should be turned on by default.

A good library is extensible (like Elixir), not full-featured. That’s why Tesla is my current choice of HTTP wrapper.

Note that httpc does not verify certificate by default

voltone · November 12, 2018, 7:31am

Neither do ibrowse, HTTPotion, gun, Tesla and SimpleHttp. And neither do hackney and HTTPoison if you pass any custom SSL options (e.g. select a custom CA, or suppress log messages with log_alert: false) without also passing verify: :verify_peer along with the right verify_fun…

chulkilee · February 22, 2019, 3:30am

xhttp is renamed to mint

Mint is different from most Erlang and Elixir HTTP clients because it provides a process-less architecture. Instead, Mint is based on a functional and immutable data structure that represents an HTTP connection. This data structure wraps a TCP or SSL socket. This allows for more fine-tailored architectures where the developer is responsible for wrapping the connection struct, such as having one process handle multiple connections or having different kinds of processes handle connections.

kats · September 6, 2019, 7:10pm

Some time ago I started implement small client library use httpc. This library definitely don’t cover all needs, but I working on it. Recently was added the ability to pass the headers and body of the request as an argument. In any case, this library can be helpful shot - is a small HTTP client library for Erlang.

dimitarvp · September 6, 2019, 9:56pm

I don’t mean to degrade, just curious if you compared your library to the Gun Erlang HTTP client?

kats · September 6, 2019, 10:11pm

Sure, I know the gun library, but gun use tcp connection, sometime we don’t need to keep connection with some services and just need to call some REST API - what was implemented inside of the shot library

dimitarvp · September 6, 2019, 10:22pm

Not sure I understand. You mean you don’t want the HTTP Keep-Alive option used when calling a REST API?

kats · September 6, 2019, 10:35pm

No, I mean, the gun use tcp connection and some time we don’t need worries about it and we need just post/get something without knowing of socket or pid of connection. The shot is wrapper of httpc.
Eg: using of gun

{ok, ConnPid} = gun:open("example.org", 443),
StreamRef = gun:get(ConnPid, "/organizations/ninenines").

Eg: using of shot:

shot:get("http://example.org/somedata").

dimitarvp · September 6, 2019, 10:52pm

Oh, ergonomy and convenience. Thanks for clarifying.

chulkilee · September 9, 2019, 6:04pm

mojito is a HTTP client based on mint - should have been mentioned!
tesla 1.3.0 is released, with mint and gun support.
castore is introduced for mint - See this issue for why new pkg not using certifi.

kodepett · December 5, 2019, 7:43pm

I ran into a strange issue today when connecting to an upstream server with Mojito - there’s a custom header which is being converted to lower case, however the upstream server expects the exact casing. Though I think this is a bad design, is there an option to force Mojito or Mint to maintain your case without converting them.

HTTP client libraries and wrappers