CI/CD for cloud-native GenAI reference architecture

Walkthrough: https://youtu.be/noilYeBqNUA?si=7awZFw5g_upbkPKc

This repo will show you how to create a complete CI/CD architecture for GenAI apps running on Kubernetes, supporting a fully private deployment.

This example will run entirely locally on a kind cluster on your laptop. In development, we will use external LLM provider together.ai. Helix will provide the versioned AI app implementation (prompt management, knowledge/RAG and API integrations) and evals (testing). GitHub actions will run the evals in CI. Flux will manage deployment.

There are two main flows: the CI (testing) flow where you can run helix test locally or in CI. And the CD flow where changes to AI apps merged to the main branch get reconciled into the cluster using Flux and deployed to Helix by the helix k8s operator.

Setup

1. Fork this repo

Start by forking this repo. This is because part of the workflow is pushing changes to the repo and having GitHub Actions and Flux react to changes, so you'll need write access to the repo.

Then check out the repo on your local machine:

export GITHUB_USERNAME=<yourusername>

git clone [email protected]:${GITHUB_USERNAME}/genai-cicd-ref
cd genai-cicd-ref

2. Install helix in kind

Requirements:

docker
kind or brew install kind
kubectl or brew install kubectl
helm or brew install helm
flux cli or brew install fluxcd/tap/flux
helix cli or curl -sL -O https://get.helix.ml/install.sh && bash install.sh --cli
ngrok or brew install ngrok/ngrok/ngrok

We will run the kind_helm_install.sh script which will create a kind cluster and install helix in it via helm.

For this deployment, to simplify things, we'll use Together.ai as an external LLM provider (which provides free credit for new accounts), but you can later attach a Helix GPU runner in Kubernetes or otherwise.

export TOGETHER_API_KEY=<your-together-key>
bash kind_helm_install.sh

If at any point you need to start over, you can just re-run the script (it will tear down the kind cluster and recreate it from scratch).

watch kubectl get po

This will show helix starting up and running in your local kind cluster. Once all the pods are running, ctrl+c the watch and run the four commands the script printed at the end of the install to start a port-forward session:

export POD_NAME=$(kubectl get pods --namespace default -l "app.kubernetes.io/name=helix-controlplane,app.kubernetes.io/instance=my-helix-controlplane" -o jsonpath="{.items[0].metadata.name}")
export CONTAINER_PORT=$(kubectl get pod --namespace default $POD_NAME -o jsonpath="{.spec.containers[0].ports[0].containerPort}")
echo "Visit http://localhost:8080 to use your application"
kubectl --namespace default port-forward $POD_NAME 8080:$CONTAINER_PORT

Leave that running.

Load http://localhost:8080 and you should see Helix. It takes a few minutes to boot.

Register for a new account (in your local helix install, through the web interface) and log in.

Detailed steps for registering and logging in

In a web browser go to: http://localhost:8080/
Register local user
1. Bottom left pane - click on "Login/Register"
2. Click on "Register" to begin the user registeration process
3. Complete user registration
To access the app, log in to the local HelixML UI with your registered user credentials. Test out creating a chat session

Install the aispec CRDs and start the Helix Kubernetes Operator. For now we do this by cloning the helix repo, but these will be properly packaged and released as container images soon. In a new terminal session (you will need go installed - e.g brew install go):

git clone https://github.com/helixml/helix
cd helix/operator
make install

Go to your helix account page (click the ... button in the bottom left and go to Account & API section) then copy and paste the export commands for HELIX_URL and HELIX_API_KEY from the "Set authentication credentials" section. Run them, then run the Helix Kubernetes Operator:

make run

Leave the operator running in this terminal window. You should have two terminal windows now: one with the port-forward running in it and another with the helix operator running in it.

Test that the operator is working by deploying an aispec just with kubectl in a new terminal window:

kubectl apply -f aispecs/exchangerates.yaml

It should look like this:

Inside helix, the app should now be working. Go to the app store on the homepage, then launch the exchange rates app:

You can use it to query live currency exchange rates.

Clean up the app:

kubectl delete -f aispecs/exchangerates.yaml

3. Install Flux

We will use Flux to automate GitOps deployments of changes to this app, rather than manually using kubectl.

Install flux in the kind cluster:

flux install

Add your fork of this repo to flux:

export GITHUB_USERNAME=<yourusername>

flux create source git aispecs \
    --url=https://github.com/${GITHUB_USERNAME}/genai-cicd-ref \
    --branch=main

Set up flux to reconcile aispecs in your fork:

flux create kustomization aispecs \
    --source=GitRepository/aispecs \
    --path="./aispecs" --prune=true \
    --interval=1m --target-namespace=default

4. Set up GitHub Actions

So that the GitHub Actions in this repository can run against your local kind cluster, we'll run ngrok and configure GitHub Actions with the appropriate HELIX_URL variable and HELIX_API_KEY secret.

Start ngrok forwarding to your local Helix server

ngrok http 8080

In a new terminal, get the public URL:

curl -s localhost:4040/api/tunnels | jq -r '.tunnels[0].public_url'

The output will look something like: https://abc123.ngrok.io

Add the URL as a variable and API key as a secret to your GitHub repository:

Go to your GitHub repository settings for this repo
Click on "Settings" tab
In the left sidebar, click "Secrets and variables" -> "Actions"
Click the "Variables" tab
Click "New repository variable"
- Name: HELIX_URL
- Value: The ngrok URL from above (e.g. https://abc123.ngrok.io)
Click the "Secrets" tab
Click "New repository secret"
- Name: HELIX_API_KEY
- Value: The API key from your Helix account page

These credentials will be used by the GitHub Actions workflows to authenticate with your local Helix instance.

Go to your Actions tab, find a failing run and re-run it to check that it works and tests the Helix app on the main branch.

Continuous Integration: Testing

Go to your helix account page (click the ... button in the bottom left and go to Account & API section, then copy and paste the export commands for HELIX_URL and HELIX_API_KEY).

git checkout -b new-feature

Edit the aispec aispec/exchangerates.yaml to add a feature or test.

Run tests locally:

helix test -f aispecs/exchangerates.yaml --evaluation-model meta-llama/Meta-Llama-3.1-70B-Instruct-Turbo

Push to CI:

git commit -am "update"
git push

You will see the tests run in github actions when you make a pull request.

Continuous Delivery: Deployment via GitOps

If the tests are green, you can merge to main. On push to main, Flux will pick up the new manifest and deploy it to your cluster.

You can run:

flux get kustomizations --watch

Flux can take up to a minute to notice the change in the repo.

Open the app in your browser by navigating to the "App Store" in your local helix install web UI, and observe the new improved GenAI capabilities!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CI/CD for cloud-native GenAI reference architecture

Setup

1. Fork this repo

2. Install helix in kind

3. Install Flux

4. Set up GitHub Actions

Continuous Integration: Testing

Continuous Delivery: Deployment via GitOps

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 61 Commits
.github/workflows		.github/workflows
aispecs		aispecs
images		images
README.md		README.md
kind_helm_install.sh		kind_helm_install.sh

helixml/genai-cicd-ref

Folders and files

Latest commit

History

Repository files navigation

CI/CD for cloud-native GenAI reference architecture

Setup

1. Fork this repo

2. Install helix in kind

3. Install Flux

4. Set up GitHub Actions

Continuous Integration: Testing

Continuous Delivery: Deployment via GitOps

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages