We have used DolphinScheduler as our schedule framework for a long time. But we always monitor some repeated scheduling questions. This blog records the whole process for resolve this issue.
We have used DolphinScheduler as our schedule framework for a long time. But we always monitor some repeated scheduling questions. This blog records the whole process for resolve this issue.
我们已经使用DolphinScheduler做为我们的调度框架很久了。但是之前总是监控到出现重复的调度的问题,此文记录排查重复调度问题的全过程。
用户在使用我们基于DolphinScheduler二开的调度平台时,发现一个bug,工作流执行到一半无法继续执行也无法停止。
没想到这个bug比较棘手,解决起来花了不少时间,本文记录解决这个bug的过程。
Now that our service is deployed in the K8s cluster, we want to deploy a monitoring service to automatically monitor the resources in the K8s cluster. This blog is divided into two parts. First is using Prometheus to monitor the service for our K8s cluster, when the monitor service finds some node or pod has issues and sends an alert as soon as possible, our deployer can fix the issues asap. And the second page is coded as simply Shell scripts to test the cluster’s network.
目前我们的业务都部署在K8s集群上,我们想要搭建一个监控服务来对K8s集群的资源进行自动化监控。本文分两节,一个是使用Prometheus来检测K8s集群,当检测到节点或者容器出现异常时及时报警,快速发现并解决问题。另一个编写了一个简单的Shell脚本来测试集群网络。
Now we use DolphinScheduler version 2.0.5 to build our data platform, and we want to improve our platform stability and find the bugs as soon as pissboy, so we want to introduce an E2E test in our project. Writing for recording enabling the E2E test process and the resolution of the Could not connect to Ryuk
question.
我们目前在使用DolphinScheduler 2.0.5版本搭建大数据调度平台,由于想要提高系统稳定性,尽早能发现bug,所以我们想在项目中引入E2E测试。本文记录启用E2E的过程和遇到Could not connect to Ryuk
问题的解决过程。
Cisco Hangzhou’s Travel Through Apache DolphinScheduler Alert Module Refactor
After modifying the url to XXX.1.html, record how to autofill increment id when new posts are created.
Update your browser to view this website correctly. Update my browser now