我正在尝试在 amazon sagemaker 中运行我自己的算法容器,在部署时,我收到如下错误.

I am trying to run my own algorithm container in amazon sagemaker,at the time of deployment time ,I am getting error like below.

predictor = tree.deploy(1, 'ml.m4.xlarge', serializer=csv_serializer)

ValueError: Error hosting endpoint decision-trees-sample-2018-03-01-09-59-06-832: Failed Reason:  The primary container for production variant AllTraffic did not pass the ping health check.


then I run same line of code this time i am getting below error.

ClientError: An error occurred (ValidationException) when calling the CreateEndpoint operation: Cannot create already existing endpoint "arn:aws:sagemaker:us-east-1:69759707XXxXX:endpoint/decision-trees-sample-2018-03-01-09-59-06-832".



@djarpin 写道:

@djarpin wrote:

ping 运行状况检查消息是一个一般错误,可能由多个不同的问题引起.通常,名为/aws/sagemaker/Endpoints/的 CloudWatch 日志组中的错误消息将更详细地描述 ping 运行状况检查未通过的原因.


07-31 18:41