Prometheus looks like it covers monitoring - includes various dashboards covering things like resource usage etc. Many tools seem to have built-in support for it (e.g. argo workflows can send data to it)
See also OpenTelemetry
Grafana Loki for logging