This is a FAQ where I preemptively try to answer questions I think people will
have and document some design decisions.
No type system? Are you serious?
Yes, and no. Or, no and yes. There isn’t a type system for various reasons:
C++ is verbose. Even this small interpreter took a lot of work. A type system
would have increased that burden, it isn’t what I am interested in, and I
am not sure the additional work would pay off since I only wish to support
trivial scripting tasks.
Type systems are a nice to have. Surprisingly, the workhorse of functional
programming after sixty years still is Lisp. Its usage outnumbers other
functional languages by several orders. I am not sure what to conclude from that
but at least it shows that in practice people don’t care much.
Types get in the way. Some mathematics is more easily described, matrix operations,
in an untyped language. I want to script data-center calculations. It seemed
a reasonable choice.
I wouldn’t know what type system to implement. As a language needs to
support more features its type system also becomes more baroque to meet
new requirements. Academics constantly publish novel type system
fixing yet another triviality, it’s hard to get right and easy to get wrong.
After I got used to the idea I actually started wondering what would open
up with an untyped language. Lisp research has lead to some baffling new
constructs, surprising usage of a simple operational semantics. I got interested
what would, or could, happen if I relaxed all constraints I met.
Eager rewriting? Surely modern math would suggest a lazy system?
No. Not for serious tasks in my opinion. In lazy systems, every once so often
someone codes a small data processing application which when run on a large data
set suddenly starts to consume all memory and becomes unresponsive.
I consider this a show stopper.
Algorithms must have predictable behavior once implemented and deployed code
shouldn’t be able to suddenly explode on data.
Programmers have no idea how lazy evaluation behaves. They simply cannot
keep track of it. Predictable behavior is better.
I do not intend to be very pure with this language. Which, see above,
eager rewriting serves better.
What is this combinator rewriting you talk about?
Combinator rewriting is an old approach to implementing (lazy) functional
languages. It fell out of grace as it was recognized as being a slow
manner of evaluation. But I still like the idea and revived it with
a twist in order to implement mobile code. The basic idea is to have
migrating combinator definitions. My translation scheme may explode
into too many of them, but hey, it’s experimental!
Do you support tail calls?
I support an experimental trivial eager rewrite system which should
make tail calls unnecessary. It’s expensive and you better hope I got
it right though.
I am implementing cool product X. Should I include Egel for scripting?
No. Egel is experimental, probably full of bugs, and you will likely regret
not having assignment. I recommend Lua, a small Lisp, or Python for your
Combinators are represented as C++ objects? Isn’t this too expensive?
Combinators are runtime constants which are instantiated only once. The
runtime passes pointers to them along.
Why do you reference count?
Egel implements a term rewriting language which implies it should
work on acyclic graphs. I wanted to see whether I could get away with
using native C++ smart pointers only. There are drawbacks to that
approach, I agree.
Reference counting is predictable. I don’t like that in most garbage
collecting schemes, for example, file handles are closed only after
the collector makes a full sweep. In Egel, file handles are closed
immediately after an object goes out of scope. That’s the way it should
Reference counting is local and memories grow larger.
Which means it might start to pay off
on larger objects and on concurrent evaluation.
Again, you better hope I got every invariant right, though.
Your bytecode doesn’t support standard arithmetic?
Wrong, I agree. It stems from trying to work in vanilla C++ where due to technical
reasons it’s more easy to have a register language on pointers only.
You do pay a hefty price for this, but my aim is to mostly support something like
matrix operations on the long run. So, I don’t care much. I hope Egel
gives you acceptable speed.
There’s no concurrency in the VM? Where’s the stack?
Not yet. An Egel object is a term referenced by a pointer. I hope to implement
async/await on these terms where I will try to exploit that a thread isn’t
larger than a pointer.