Commit Graph

1159 Commits (d2acb067a540603c773047a9351ded55eda478f6)

Author SHA1 Message Date
Aditya Maru 8a782a7c2b
Merge pull request #80 from useblacksmith/gc-inode-tweaks
src: disable automatic buildkit GC
2024-12-23 09:21:40 -05:00
Aditya Maru 9fdeb57c53 src: disable automatic buildkit GC
We have reason to believe that automatic GC is affecting
daemon startup times. In this patch we disable automatic GC
and instead rely on manual pruning of the buildkit cache.
Once the daemon is ready we spawn an async task to run prune
on any objects older than 14 days. We are already manaing the
ceph volume approaching its size limit ourselves in the VM
Agent.

Patch also adds some alerting when inode usage is high on a mountpoint.
2024-12-23 09:15:34 -05:00
Aditya Maru d0a5da09cc
Merge pull request #83 from useblacksmith/print-api-url
src: print api url in debug info
2024-12-21 23:43:51 -05:00
Aditya Maru 61713d1849 src: print api url in debug info 2024-12-21 23:42:52 -05:00
Aditya Maru 79d3930e8c
Merge pull request #82 from useblacksmith/silence-metric-warning
src: silence metric warnings for now
2024-12-21 23:13:47 -05:00
Aditya Maru 6fe2467492 src: silence metric warnings for now 2024-12-21 23:12:08 -05:00
Aditya Maru 6a143ed7fc
Merge pull request #81 from useblacksmith/use-backend-env
src: use the plumbed BLACKSMITH_BACKEND_URL if present
2024-12-21 12:26:38 -05:00
Aditya Maru 4759d93c12 src: use the plumbed BLACKSMITH_BACKEND_URL if present 2024-12-21 12:08:11 -05:00
Aditya Maru e836937c09
Merge pull request #79 from useblacksmith/wire-metrics
*: report metrics to the VM agent
2024-12-20 19:18:02 -05:00
Aditya Maru def1585067 *: report metrics to the VM agent 2024-12-20 17:43:40 -05:00
Aditya Maru 70572bc121
Merge pull request #78 from useblacksmith/debug-shutdown-spurious-warning
src: stop spurious warnings on buildkit shutdown
2024-12-19 19:19:20 -05:00
Aditya Maru 4723a2a346 src: stop spurious warnings on buildkit shutdown 2024-12-19 19:04:07 -05:00
Aditya Maru 31dd0ccbdd
Merge pull request #77 from useblacksmith/fix-shutdown
src: fix shutdown retry behavior
2024-12-19 13:10:27 -05:00
Aditya Maru 1672d6fbad src: fix shutdown retry behavior 2024-12-19 13:04:09 -05:00
Aditya Maru 726f2f6338
Merge pull request #76 from useblacksmith/run-diff
src: stop running process as nohup to avoid missing logs
2024-12-19 12:56:48 -05:00
Aditya Maru 9302d2aea9 src: stop running process as nohup to avoid missing logs 2024-12-19 12:44:35 -05:00
Aditya Maru e9a7d2cb89
Merge pull request #74 from useblacksmith/debug-failures
src: cleanup flakiness in different parts of the action
2024-12-18 10:10:27 -05:00
Aditya Maru ac42783fa9 src: cleanup flakiness in different parts of the action 2024-12-18 09:58:15 -05:00
Aditya Maru 7227817bb9
Merge pull request #73 from useblacksmith/socket-cat
src: refactor cleanup logic to expose buildkitd.log
2024-12-16 19:40:39 -05:00
Aditya Maru 54bc4e0788 src: refactor cleanup logic to expose buildkitd.log
Previosuly, we only killed the buildkitd process and unmounted
if builderInfo was non null. This was wrong cause we could have setup
builkdkitd, but failed after that step. This would then rely on the last
ditch effort by the post action to cleanup. We now change the proc kill
and unmount to happen on any build error.
2024-12-16 19:25:47 -05:00
Aditya Maru 8d0da8c56b
Merge pull request #72 from useblacksmith/move-to-grpc
*: move to grpc backed communication for the agent
2024-12-16 15:38:39 -05:00
Aditya Maru d43ee61bb7 *: move to grpc backed communication for the agent 2024-12-16 15:29:30 -05:00
Aditya Maru c7c50538d0
Merge pull request #71 from useblacksmith/subdue-noisy-log
main: ignore error when nothing is mounted
2024-12-15 17:17:31 -05:00
Aditya Maru 53000f0f59 ignore error when nothing is mounted 2024-12-15 17:16:24 -05:00
Aditya Maru 23eae444a4
Merge pull request #70 from useblacksmith/cleanup-failure
src: ignore error when theres nothing mounted
2024-12-15 17:03:59 -05:00
Aditya Maru 1df1b3c361 src: ignore error when theres nothing mounted 2024-12-13 12:32:05 -05:00
Aditya Maru 5b9a178332
Merge pull request #69 from useblacksmith/umount-timeout
src: make post unmount even if buildkitd is no longer present
2024-12-10 21:34:24 -05:00
Aditya Maru de0451e517 src: make post unmount even if buildkitd is no longer present
Also increase retries when trying to unmount the buildkit directory.
Retry up to 3 seconds now, previously we were only retrying 3 times
with a 100ms backoff.
2024-12-10 21:26:18 -05:00
Aditya Maru c6b6f32adc
Merge pull request #68 from useblacksmith/stop-sending-form-data
src: start sending get request with query params
2024-12-09 13:07:25 -05:00
Aditya Maru 0f99a0b1c7 src: start sending get request with query params
We are incorrectly using formData in a get request. To move
away from this we send both query params and formData until
the server is fully upgraded. After which we can stop sending
formData.
2024-12-09 13:01:35 -05:00
Aditya Maru 0186286e06 *: use axios-retry instead of handrolled retry methods 2024-12-09 13:01:20 -05:00
Aayush Shah 7b8642822f
src: make `getDockerfilePath` return the full path to the dockerfile (#64)
Previously we were just returning the path to the dir containing the dockerfile
in most cases.
2024-12-09 12:20:46 -05:00
Aditya Maru a415504014
Merge pull request #66 from useblacksmith/alert-on-cleanup
src: alert if an exception is thrown on cleanup
2024-12-08 20:12:05 -05:00
Aditya Maru f06a558c36 src: alert if an exception is thrown on cleanup 2024-12-08 19:21:46 -05:00
Aditya Maru 1cc1561946
Merge pull request #65 from useblacksmith/fix-conditional
src: fix bug in conditional that zero'd out expose ID
2024-12-08 18:51:55 -05:00
Aditya Maru b76cd7bf3b src: fix bug in conditional that zero'd out expose ID 2024-12-08 18:44:36 -05:00
Aditya Maru 9f63c681ba
Merge pull request #63 from useblacksmith/refactor-for-tests
*: refactor methods to support mocking
2024-12-08 17:07:26 -05:00
Aayush f9d1e150a9
*: unify error handling and add more unit tests 2024-12-08 16:41:30 -05:00
Aditya Maru c71ad2dbef *: refactor methods to support mocking
Additionally, write some tests to ensure the driver method
`startBlacksmithBuilder` handles all exceptions correctly in
both nofallback=true and nofallback=false configurations.
2024-12-08 14:35:30 -05:00
Aditya Maru 15e5beff2d
Merge pull request #62 from useblacksmith/default-devdb
backend: default to /dev/vdb when ExposeVolume response is empty
2024-12-06 22:42:06 -05:00
Aditya Maru 5ab78173d3 backend: default to /dev/vdb when ExposeVolume response is empty 2024-12-06 22:39:20 -05:00
Aditya Maru 96821d6e49
Merge pull request #61 from useblacksmith/use-expose-resp
backend: use device returned in ExposeVolume response
2024-12-06 22:33:12 -05:00
Aditya Maru edc01b36df backend: use device returned in ExposeVolume response 2024-12-06 22:31:43 -05:00
Aditya Maru 3d09910efc
Merge pull request #60 from useblacksmith/stickykey-commit
src: send stickydisk key with commit
2024-12-03 20:41:30 -05:00
Aditya Maru 72c7e93db9 src: send stickydisk key with commit 2024-12-03 18:31:15 -05:00
Aditya Maru e00a3895c3
Merge pull request #58 from useblacksmith/bump-sock-timeout
src: bump socket creation timeout to 5s from 3s
2024-12-03 16:57:37 -05:00
Aditya Maru 6d7db93fa2 src: bump socket creation timeout to 5s from 3s 2024-12-03 16:44:56 -05:00
Aditya Maru 2d7d5ad6c2
Merge pull request #57 from useblacksmith/fix-report
src: report the correct sticky disk key
2024-12-03 15:41:41 -05:00
Aditya Maru c308f14958 src: report the correct sticky disk key 2024-12-03 15:34:30 -05:00
Aditya Maru 0fb5318cf4
Merge pull request #56 from useblacksmith/shuttle-exposeid
src: shuttle a expose ID between expose and commit
2024-12-02 17:44:23 -05:00