- Ben Makes Stuff
- Posts
- How to self-host your app in 2024, the right way
How to self-host your app in 2024, the right way
And where I've been for the past several months
Hey there, been a minute đ
Itâs been a while since my last blog post! If youâre an old subscriber you might remember that during my first blog post I said Iâd build 12 startups in 12 months⌠and then I just kind of disappeared around month 7.
I bet youâre sitting here expecting some kind of elaborate explanation in terms of why I was gone, but the reality is that around month 7 I had a couple of my ideas start to make some money so I decided to focus on them.
The â12 startups in 12 monthsâ thing isnât meant to be taken literally: you build new ideas until something starts to show some promise, then you focus. Which is what I did.
I also got a bit burned out of blogging. Iâve decided Iâm going to keep blogging, but only when I feel like thereâs something truly interesting to share related to my journey of building startups.
This month, I do have something interesting to share: a âbuyers guideâ and how-to on self-hosting!
And itâs going to be a long one because I put a lot of research into this, so if you find some value out of what Iâve written here:
Hereâs a link to âď¸ buy me a coffee and show your support!
Self-hosting? Whatâs that all about now?
This monthâs post is all about self-hosting for people building software startups, otherwise known as the âcloud exit,â cancelling your Vercel account and buying a low cost VPS instead, or whatever else youâd like to call it.
The general idea behind this is that if youâre a self-hoster, you deploy and maintain your own apps without the âhelpâ of certain products that attempt to do that for you.
Now letâs go through some objections I hear a lot:
âWhy would you want to do that? Vercel/AWS/etc are all fine and helped me scale quickly.â
Because products that supposedly âhelpâ you often hurt you in the long run: once you start to achieve any degree of scale, you can end up with surprise bills from the likes of Vercel, AWS, or any other company that has usage-based pricing.
Just to give you an example, my AWS bill is expected to be over $80 this month just to host two simple apps:
My real AWS bill for this month. Iâm not using anything crazy, just standard EC2, Lambda, and Postgres on RDS. Turns out Bezosâ yacht doesnât run itself.
Thatâs insane considering neither of my apps get any significant amount of traffic at this point and Iâm using some of the cheapest virtual machines available on the platform.
Compare that to Hetzner who offers a powerful virtual private server (VPS) with 2 Cores and 4 GB RAM for a grand total of 3.85 EUR/month đ¤
When the alternative is that cheap, AWS is a ripoff unless you have unlimited VC money or credits to burn.
âOk, makes sense - but self-hosting is too hard and I donât have time!â
You might think that, but itâs total nonsense. Thereâs this myth that self-hosting requires an entire operations team to help you, itâs too much work, or any other number of excuses.
The reality is that with a cheap VPS provider and the right technology choices, you absolutely can and should deploy your own apps with very minimal ongoing maintenance, and Iâll show you how to do it the right way in this post.
âBut I have cloud credits to spend anyway.â
Cool, then spend them! Iâm not trying to tell you to spend unnecessary money, so milk AWS and Vercel for all theyâre worth. Then come back to this post when youâre ready to spend your own money.
Now that Iâve handled all of your objections, read on to understand how to do this - as I said, the right way.
As you might realize already, this isnât going to be one of those blog posts where the conclusion says that âtHe SoLuTiOn DePeNdS oN yOuR uSe CaSeâ - nope, I have strong opinions about this and I think thereâs a good way to go about it for most people and well, here it is.
Some assumptions
Letâs start with some assumptions about the type of people who will definitely find my method of self-hosting useful.
Youâll probably find value out of this if youâre a solo founder or a small team and:
You have existing app(s) deployed somewhere like AWS or Vercel and hate your monthly bills.
Or youâre just starting out and donât want to get shafted by usage-based pricing at any point.
Or you just like doing things yourself.
Iâm a little of box 1 and box 3 myself. No shame if youâre just box 3!
You should also have an app (or be planning on building one) that has some sort of web API that is deployed via Docker.
You should also know how to run bash commands and create files. If you donât know, ask ChatGPT how to do it.
Finally, if youâre doing something crazy with GPUs or youâre building the next Google, you will have further considerations that I wonât cover here - and if you think youâre the next Google, you need your head checked anyway.
But do we have to 100% self-host everything?
Ok, this is all about DIY and self-hosting, but I have to admit something upfront: while you could do absolutely everything yourself - and I used to do that myself with PHP + Apache in 2009 - these days Iâm convinced that self-hosting particularly as a solo developer is made much easier and maintainable over the long run by solutions that are halfway in between 100% DIY and a full-blown Kubernetes Cluster.
The latter does require an operations team to manage especially at a high degree of scale, and thatâs not what you want.
The former wonât be as crazy, but you will still have to spend time doing research on which reverse proxy you want to use (nginx vs. caddy vs. traefik vs. any number of other solutions), how to set up logging, how to integrate with CI, limiting the disk space that your logs can use, how to make sure apps automatically restart when the host/process dies, picking a linux distro, and generally manually configuring things when you could be building apps.
With that said, I think most people should use an open-source, minimal, self-hosted âplatform as a serviceâ - otherwise known as a PaaS - to help alleviate some of the above config and maintenance burden. This is the middle ground solution that Iâm referring to above.
Whatâs a self-hosted PaaS? Think Heroku, AWS Elastic Beanstalk, or Google App Engine, but you host it yourself. If that sounds complex, itâs really not so bad. Iâll show you how to set it up below.
For your database if you have one, keep it simple and use SQLite unless you have a reason to use Postgres like I do. For the purposes of this blog post though, Iâm assuming SQLite and no external dependencies for data storage.
But first, some requirements.
Now, letâs think through requirements for open-source PaaS software that can help us with our self-hosting goals.
These are the requirements I had in mind as I evaluated vendors as someone who has been deploying software professionally for over 10 years, and I think you should borrow these for your own app too:
0ď¸âŁ Zero downtime deployments
This means when you deploy your API for example, outstanding HTTP requests can finish executing without being abruptly cut off. Failing requests = sad customers đ˘ - think of the customers and keep uptime to a maximum!
đ¤ Some way of automatically managing app processes running on a single host
Critically, when you deploy a new version of the app, traffic should be automatically and cleanly routed to the new app. You should also be able to increase or decrease the number of processes of each app.
Single-host deployments need to be supported, meaning you can deploy multiple apps to one beefy VPS without any hacks or compromises.
đď¸ Very low resource consumption (absolutely zero at idle) and very few or zero dependencies by default.
Some dependencies might be okay, like Docker and SSH. Those are things most people already have installed so theyâre not a big deal.
But this is not rocket science! We donât need to add a Postgres server, Redis instance, a websocket server, Kubernetes, or anything fancy just to deploy an app to a host.
Besides, adding dependencies opens you up for security vulnerabilities and maintenance headaches (someone needs to keep your dependencies up-to-date, and now that responsibility is on you)
â
Well-tested, clearly documented, proven software that has been around for several years or more
Most times when I break this rule and try something ânew and shinyâ because it sounds cool I regret it. âNew and shinyâ usually means poorly documented and full of time-wasting gotchas with very few exceptions.
As a very important bonus, stuff that has been around a while will be indexed by OpenAI and Anthropic - so you can find help by asking ChatGPT if you get stuck. Itâs a huge productivity boost and like having your very own customer support agent/expert in the room with you.
đ Sane defaults and excellent ease of use through simple commands + CI integration
The PaaS we choose should âjust workâ and not require us to do too much complex configuration to get an app deployed. It should be usable in a CI pipeline.
Commands for managing apps should be able to be done through SSH or through a standard bash command. This is key because scripting and automation becomes much more possible when you can perform maintenance using commands instead of UI button clicks. A UI is a nice to have and not required.
Overall, it should be resilient to failure (survives node restarts, app restarts, etc), get out of our way, and do what itâs supposed to do.
Self-hosted PaaS Options
Browse X or do a few Google searches, and youâll find a lot of different options that claim to help you with the above requirements:
Kamal: new software, first released in 2023 and built by the 37signals team behind Ruby on Rails and Basecamp.
Dokku: first release way back in 2014! Originally released as an alternative to Heroku and now has a lot of modern features including deployment of Docker containers.
Caprover: first release was in 2017. I see this as somewhat similar to Dokku, but with more of a focus on managing things through a UI.
Coolify: first released in 2021 and built by a solo founder named Andras who seems like a nice guy.
K3s: first released in 2019, a âlightweightâ version of Kubernetes that is meant to be run on resource-constrained environments. Originally developed by Rancher Labs and now seems to be a product of SuSE?
At this point, youâre probably thinking: âwow, thatâs a lot of different options - overwhelmingâ
Good thing I tested every single one of these for you so you donât have to!
It goes without saying that none of the authors of any of these tools paid me a dime to do this analysis and I have nothing to sell you related to any of these.
So, if you appreciate this kind of thing and want to show your support, hereâs a link to buy me a coffee. I massively appreciate it!
With that out of the way: itâs time to get to the test results! But first, some setup notesâŚ
Test machine specs
For testing, I used several identical VPS nodes with the following setup with one PaaS deployed to each:
Host: Amazon Web Services
Region:
us-east-1
EC2 Instance type:
t4g.medium
2 vCPUs (arm64)
4 GB RAM
30 GB Storage
OS: Debian 12
Networking: IPv4 + IPv6 (dual stack)
Port speed: 5 GB/s
These are pretty much the kind of specs Iâd recommend in production as well, although I would not use AWS for obvious reasons. Production specs will be covered in more detail after the test results below.
The test results
Well then, how did all of these different PaaS apps perform?
Iâll just come out and say it: I cannot recommend 4 out of the 5 in good faith.
Here are the results:
Coolify
Coolify has a nice looking marketing page and seemed like it was worth a test. I did a fresh install as recommended by the docs, then rebooted to see how it would handle a node restart.
Well, it didnât:
NOTE: as of 9/3/24, @heyandras let me know this problem has been resolved (thanks Andras for fixing it!):
https://x.com/heyandras/status/1830689101873328617
The nice error message I was greeted with after rebooting. Itâs apparently fixed now, thanks Andras!
Then I noticed more errors in the logs when trying to understand what was going on:
Did the author write this healthcheck properly? âI am alive!â is considered an error response?
Also, I really, really, really was not a fan of all of the bloat it installed on the server after a fresh install with nothing running. See here:
PHP and Redis and Websockets and random containers oh my đŹ
Thatâs over 10% of your RAM gone for no reason at all, plus a few percentage points of CPU utilization that shouldnât be there.
I say âgone for no reason at allâ because you donât need any of this stuff to deploy an app to a server, I promise.
Here are some of the docker images it uses as dependencies including a 2 year old version of Postgres that is in need of an update. See what I mean about maintenance headaches when you add dependencies?
To be clear, these extra dependencies is probably why Coolifyâs docs âdonât recommendâ installing Coolify on the same server the apps run on.
But that solution is really not ideal either - now I have to run Coolify on my local machine, pay for Coolify Cloud, or buy two VPS nodes just to get my apps deployed on one VPS.
At this point, I nixed Coolify because of the reliability issue above and the unnecessary resource usage when running everything on one node, which is my entire goal here. Additionally, I couldnât find docs about running commands to manage Coolify - only using the UI - so that was a dealbreaker too.
Kamal
I really wanted to like Kamal because itâs basically just a wrapper around Docker and every command runs over SSH. Nice and simple - in theory.
Unfortunately, the tl;dr in practice is that itâs too new, the docs are lacking, and errors you might run into are hard to diagnose. ChatGPT doesnât have much accurate info about it either.
NOTE: I regret that I seem to have lost all of the screenshots I took while testing Kamal, so you will have to take my word for it on this one.
Iâll also say that I know people who deploy using Kamal successfully - mostly with Ruby apps, but I experienced many âgotchasâ with this tool. I didnât like that you needed to run a separate kamal
utility on your local machine, that it required a kamal-specific deployment config file and that it had a weird method of managing environment variables where you have to define them in multiple places before they get uploaded to the host.
Above all, the tool simply didnât work for me. When I finally got the app deployed after a few hours of struggling, I thought I had âwon.â Then I went to deploy a new version of the app with a simple, non-breaking change and Kamal refused to switch traffic to the new container with some hard to understand error message.
The only thing that fixed this problem for me was wiping the host and doing a fresh kamal setup
at which point it started working again on the first deployâŚuntil I deployed another new version of the app again, and so on.
Since I never had a successful test on Kamal and the product did not work as advertised, Iâm not going to include much further detail here.
Kamalâs resource consumption was quite good at idle (nothing extra running), but I think it needs a lot of work on documentation and error handling to be a viable choice. It doesnât matter if itâs the most efficient tool in the world if it doesnât work.
Kamal 2 is coming out soon which sounds promising, but itâs not here yet and thereâs better options available today.
Caprover
Next, I looked at Caprover. This one was eliminated when I looked at the docs:
This is worrying. First, it doesnât fit my need of being able to do things on the command line to have to log into a UI and manually input fields to scale out a deployment.
Second, it appears that some sort of custom leader node election algorithm has been implemented - and not enough detail has been provided.
These are not easy things to implement correctly - and I donât have confidence the author did it correctly if there are no details here on how exactly it was implemented.
I also didnât like how the maintainer of the project conducted himself when presented with a problem about one of the dependencies in the project:
Weird, condescending tone in his response. Not a fan of working with people like this.
At this point, I nixed Caprover. Mostly because of the lacking documentation and inability to use the command line to perform scaling actions, but this interaction didnât help.
K3s:
K3s basically did what it promised and the docs are quite extensive. Bravo to the team maintaining K3s.
If you run the install command as described, you get a ready-to-use K8s instance running on fewer resources than youâd expect from a standard K8s deployment which you can confirm with kubectl get node
:
I forgot to take a real screenshot of this part, but hereâs what it looks like: imagine just one line instead of 3 and 127.0.0.1 instead of node1.example.com.
I hard rebooted the machine right after this and it came back up without any issues.
Unfortunately though, K3s suffers from resource usage problems while idling. Even though it bills itself as lightweight, and it is compared to K8s, itâs really not so lightweight overall:
Thatâs 17% of your RAM and 4.3% of your CPU being used forâŚnot sure what. metrics-server is also included in the calculation.
Itâs also not ideal for ease of use in a CI environment. To deploy a docker image to the cluster, you need to define a fairly wordy yml file every time you deploy and pass it to kubectl apply -f
:
apiVersion: apps/v1
kind: Deployment
metadata:
name: nginx-deployment
labels:
app: nginx
spec:
replicas: 3
selector:
matchLabels:
app: nginx
template:
metadata:
labels:
app: nginx
spec:
containers:
- name: nginx
image: nginx:1.21.1
ports:
- containerPort: 80
---
apiVersion: v1
kind: Service
metadata:
name: nginx-service
spec:
selector:
app: nginx
ports:
- protocol: TCP
port: 80
targetPort: 80
type: LoadBalancer
This can be automated by creating a template file and substituting the image name with the current version to deploy among other properties, but this requires extra time to get set up.
I also see the number of dependencies involved here as a negative, even if they happen to be packaged together:
Good luck fixing anything on your own if any of these components malfunction! Thatâs a lotta components.
Last but not least: Dokku đ (Winner!)
Finally, I got to Dokku.
As soon as I installed Dokku, it became obvious that this was the first self-hosted PaaS I had tested that really prioritized all of the things I was looking for.
First, the docs are truly great. I wonât go into it too much here, but hereâs one selection from the docs about how to deploy a docker image to the host:
Notice how much simpler it is than fighting with some giant k8s yaml file. Just one command!
Secondly, after a fresh install, look at the list of running processes:
Itâs a great sign when the highest CPU usage comes from my own top command. Look ma, no memory usage either!
This is how it should be done. No extra bullshit, no unnecessary containers. A PaaS that gets out of the way when it isnât needed!
I then took a look at the dokku
executable that gets installed on the host and it turns out itâs a fairly easy to read bash script without any weird dependencies:
Hereâs what the script looks like. It eventually just invokes docker for commands you might run on your apps.
Again, perfect. Then I noticed Dokku had a prebuilt Github Actions plugin to deploy an app.
I spent about an hour setting this up, mostly because I didnât read the docs and forgot to configure docker registry credentials, but it was quite easy to get going with a bit of Terraform magic that Iâll include for everyone below.
At the end of the day, I just needed to add one block to my deploy script:
This is all I had to add to get the deploy working after I followed the setup steps in the docs.
After minimal fuss, my app was deployed, and I was happy to see some sane defaults: nginx is very resource efficient and battle-tested as a reverse proxy, and I liked that it included output from my app so that I could see the deploy succeeded:
This kind of output is something no other solution got right. Kamal in particular had about 10x the output for any command that was run and it was completely impossible to parse and understand, whereas Dokkuâs output is clear.
Recommended specs and OS for a production VPS
Now that youâve seen my strong preference is to use Dokku to host my apps, letâs cover the specs for an ideal virtual private server (VPS) for this purpose.
In production, Iâd opt for an instance on Hetzner: go to https://www.hetzner.com/cloud/ (not a referral link), select Shared vCPU Ampere, and select the CAX11 node type.
Thatâs the best bang for your buck since youâll get 2 vCPU Cores and 4GB RAM for less than 3.79 EUR (or about ~$4.30 US). Hard to beat that kind of deal!
You could also use any of Hetznerâs AMD or Intel-based nodes which would be fine and may be closer to your customers if you have US-based customers, but I prefer arm64 since my development machine is a Macbook Pro with an arm64 processor and I like my development machine to match my production machine in terms of CPU architecture. Most of my customers happen to be in the EU anyway.
This setup with arm64 in dev and in production lowers the chances of some difference in behavior between those environments. I also like that arm64 is cheaper of course.
A note on the OS: I recommend Debian.
It also can automatically install security patches for you which is a key thing to have as a solo server admin, and itâs a reliable, minimal Linux distro that only includes the necessary software with very stable versions of said software.
As an endorsement: Iâve used various versions of Debian in production with zero issues for many years now, and a previous company I worked for (with a market cap of tens of billions of dollars) ran 100% of its servers on Debian, also with no issues.
Now for a note on IPv4 vs. IPv6: it may seem tempting to save money and go for an IPv6-only node because Hetzner will give you a further discount. Do not do this!
Why? Well, many websites donât support IPv6 yet, and you will run into connection errors if you have a dependency on a 3rd party website that doesnât. You might not even realize that you have a dependency on one of these sites and run into some very hard-to-diagnose bug as a result.
For example, github doesnât support IPv6 yet and neither does X. You can confirm which websites support this by issuing a dig command.
dig AAAA x.com +short
If the above command doesnât have any output, the website in question does not yet support IPv6 and any requests made to it from a host with only an IPv6 public IP will fail.
Deploying Dokku in Production as a solo developer
If you donât feel like reading the the docs which are quite good, hereâs my own guide on how to get Dokku deployed to a Hetzner node in production:
First tap âAdd Serverâ
Then select a location close to your customers and Debian as the OS
Finally select Ampere as the node type and pick the cheapest one. Notice the great deal: 3.79 EUR/mo đ
Finish the rest of the wizard, and log into the node with SSH ssh [email protected]
.
Then paste this bash script below. Important notes:
1. Make sure to edit it with you SSH public key from your laptop, and another one for your CI system if any.
2. Also make sure to add your appâs environment variables if any under âmake sure to set your app config hereâ
To generate a public key, just use:ssh-keygen -t ed25519 -C "github-actions"
And copy the output from the .pub
file that it put in your ~/.ssh
folder into the script below âADD YOUR KEYS BELOW!â
#!/bin/bash
# Ensure latest updates are installed
apt update -y
apt upgrade -y
# Install Dokku via official install script (make sure to find the latest version on dokku.com and use that as the DOKKU_TAG and url for wget)
echo 'Installing Dokku...'
wget -NP . https://dokku.com/install/v0.34.9/bootstrap.sh
sudo DOKKU_TAG=v0.34.9 bash bootstrap.sh
# Add SSH keys to Dokku so our dev machine + CI tool can run Dokku commands. ADD YOUR KEYS BELOW!
echo 'Done installing Dokku. Adding SSH keys...'
# Example: echo 'ssh-ed25519 AAAA... your-laptop' | dokku ssh-keys:add your-laptop
# Example: echo 'ssh-ed25519 AAAA... github-actions' | dokku ssh-keys:add github-actions
# Set up Dokku apps and credentials
echo 'Done adding SSH keys. Creating apps...'
dokku apps:create your-api
echo 'Done creating apps. Configuring app domains...'
dokku domains:set your-api api.yoursite.com
echo 'Done configuring app domains. Starting vector for logging...'
dokku logs:vector-start
wget -O /var/lib/dokku/data/logs/vector.json ${VECTOR_SINK_CONFIG_URL}
echo 'Done configuring vector. Authenticating with Docker registry...'
dokku registry:login ${DOCKER_REGISTRY_HOST} ${DOCKER_REGISTRY_USERNAME} ${DOCKER_REGISTRY_ACCESS_TOKEN}
# Make sure to set your app config here if any
# dokku config:set your-api SOME_APP_SECRET1=value1 SOME_APP_SECRET2=value2
# Reboot and finalize installation
echo 'Done with everything, waiting 30s before rebooting to finalize installation.'
sleep 30
systemctl reboot
For logging, you can either comment out the dokku logs
line to disable exporting logs, or sign up for a free account with Better Stack to store 3GB/month of logs for free.
If you do choose to use Better Stack, they support Dokku natively so just create a new log source and they will give you a command to set up the log source with Dokku
The create source UI with Dokku selected
You will need to run the script like this:
DOCKER_REGISTRY_HOST=ghcr.io DOCKER_REGISTRY_USERNAME=username DOCKER_REGISTRY_ACCESS_TOKEN=ghp_... VECTOR_SINK_CONFIG_URL="https://urlfrombetterstack.com" ./init.sh
When run, this script will take about 10-15 minutes to finish and:
Install Dokku which will use nginx as a reverse proxy for all your apps by default. You shouldnât need to configure this at all.
Pre-create all of your apps. Dokku will auto create any apps that donât exist when you deploy for the first time, but itâs better to pre-create them so you can set the domain for each app upfront and have your app work immediately.
Set up Dokku to be able to pull images from your Docker registry
Set up logging with export to Better Stack (nice UI for viewing your app logs in a web interface and to create alerts)
Reboot the VPS (so donât be alarmed if you get disconnected)
Wait the full 10-15 mins for the process to complete and the node to come back online. To test that the process worked, run this command on your local machine:ssh dokku@yourserverip version
This will use the SSH key you generated above. And you should get an output showing you the version of Dokku running on the server.
Now update your CI script (hopefully youâre using GitHub Actions, it makes this part easier) to include one additional command which includes the SSH private key you generated above as a GitHub Repository Secret:
- name: Deploy API
uses: dokku/[email protected]
with:
ssh_private_key: ${{ secrets.SSH_PRIVATE_KEY }}
git_remote_url: ssh://dokku@yourserverip:22/${{ github.repository }}-api
deploy_docker_image: ghcr.io/${{ github.repository }}-api:${{ github.sha }}
Run the deploy and you should see a message showing you that your app was deployed to Dokku at the URL you set. Youâre done with deployment!
Security and Hardening
Now that your app has been deployed, itâs time to think about how to harden it so that attackers will have a tougher timeâŚwell, attacking you.
The first recommendation I always give to people is to set up a Cloudflare account.
Putting Cloudflare in front of your domains prevents a whole class of DDoS attacks and Cloudflare is able to cache certain responses for you to save you bandwidth.
After you set Cloudflare up, go to the websites tab and add your appâs website, then edit the DNS settings:
In the above screenshot, youâll want to create an A
record with the IP address pointing to your VPSâ IP address with Dokku on it and make sure âProxy statusâ is proxied (orange cloud icon).
Then, back in your Hetzner control panel, create a firewall rule that only allows Cloudflare IPv4 and IPv6 addresses (copy and paste both sets of IPs from here: https://www.cloudflare.com/ips/) to access your VPS on port 80 (the web port that Dokkuâs nginx instance is serving traffic on):
Firewall setup example
Then scroll down to Apply to and make sure you select the server you just created
Then hit Create Firewall.
The effect of this is that only cloudflare IP addresses will be able to communicate with your VPS on port 80. This means attackers will be completely blocked from hitting your VPSâ IP directly, which would otherwise bypass Cloudflare.
You might also consider adding a rule that only allows GitHub actions CI IP addresses and your home IP to access port 22 for SSH.
Finally, to lock down your SSH server even further, make sure to disable password auth. This should already be configured as most Debian images provided by hosting providers do this already, but good to double check:
This will only allow users with valid SSH keys to connect and avoids the problem where attackers attempt to brute force passwords.
Conclusion
Thatâs all we have folks!
I put a lot of research into this, so if you find some value out of what Iâve written here, hereâs a link to âď¸ buy me a coffee and show your support!
Thanks so much for reading either way.
Reply