English3000
Immutability, dereferencing, and complex data structures
What’s on my mind is when a small part of a complex data structure is modified, one must traverse from the root to that part to update the whole.
For the children of the part being modified, one can effectively dereference. Why can’t we dereference the direct parent of that part, saving us the trouble of traversing the rest of the data structure?
The implementation would be adding a reference to its parent on the part. Then, its immediate parent could be accessed (without the need to traverse from the root to the part to be modified), and that parent could mutably dereference to the modified child.
Because a reference was changed and not any independent data, the benefits of immutability would be preserved–as one cannot concurrently modify a single parent. AND, one can concurrently modify its children, and then dereference to the parent. In this sense, adding mutable dereferencing actually makes concurrency more powerful in an immutable language… we should add it to Elixir!
How could this be implemented? Have I missed any pitfalls?
And just to recap, I am highlighting the distinction between mutating data and mutating references in relation to concurrency. The latter is okay because the datum depends on the other, meaning it can’t be modified concurrently anyways!
Most Liked
rvirding
I think the thing to be very aware of is that all data, ALL data, in Erlang/Elixir is immutable. This is not just a language feature but implemented/enforced all the way down in the BEAM. This means that when mutating a structure and rebuilding “on the way up” you never have to make copies of the unchanged parts of the structure only the actual path down to the new parts. No one else can change them so you are always safe. So yes when appending a new element to the end of a list you need to rebuild the list cells but there is no need to copy the actual elements in the list.
This means that there is actually no copy instruction/call in the machine. Well, there is one for binaries but that exists for one special purpose.
Writing NIFs for accessing parts of deep structures would be fine but I am not certain that they would be very much faster than writing code instead.
NobbZ
I’m not sure if I understand you correctly, but given a list [1, 2, 3], you want to alter the second element that the list looks like [1, -2, 3] and change the pointer from the first element to the second as well?
Such that we get this:
old = 1 ---> 2 ---> 3 ---> []
garbage = 2
new = 1 ---> -2 ---> 3 ---> []
This would brake my assumption, that I were able to use old as it was before. Also it would brake assumptions of the garbage collector about the structure of the heap (old items never reference newer ones).
If though I got you wron, can you please elaborate?
dom
I’m not sure I get what you’re suggesting, but newer data can’t point to older data in Erlang, it would break the GC. So you can’t have a child node keep a reference to a parent.
https://www.erlang-solutions.com/blog/erlang-garbage-collector.html







