Azure ML web服务超时
本文关键字:超时 服务 web ML Azure | 更新日期: 2023-09-27 18:00:30
我在Azure ML中创建了一个简单的实验,并用http客户端触发它。在Azure ML工作区中,执行时一切正常。然而,当我使用http客户端触发实验时,实验超时并失败。为http客户端设置超时值似乎不起作用。
我们有没有办法设置这个超时值,这样实验就不会失败?
确保您正确设置了客户端超时值。如果为web服务供电的服务器超时,那么它将发回一个HTTP状态代码为504 BackendScoreTimeout
(或可能为409 GatewayTimeout
)的响应。然而,如果你根本没有收到回复,那么你的客户等待的时间不够长。
通过在ML Studio中运行实验,您可以找到大量的时间。转到实验属性,了解它运行了多长时间,然后将大约两倍的时间作为超时值。
我在作为web服务发布的Azure ML实验中也遇到过类似的问题。大多数情况下,它运行正常,而有时返回时会出现超时错误。问题是实验本身有一个90秒的运行时间限制。所以,很可能你的实验运行时间超过了这个限制,并返回超时错误。hth
似乎无法根据截至2018年4月1日仍标记为"计划"的功能请求设置此超时。
MSDN论坛从2017年开始建议使用批量执行服务,该服务启动机器学习实验,然后异步询问是否完成。
以下是来自Azure ML Web服务管理示例代码的代码片段(所有注释都来自其示例代码):
using (HttpClient client = new HttpClient())
{
var request = new BatchExecutionRequest()
{
Outputs = new Dictionary<string, AzureBlobDataReference> () {
{
"output",
new AzureBlobDataReference()
{
ConnectionString = storageConnectionString,
RelativeLocation = string.Format("{0}/outputresults.file_extension", StorageContainerName) /*Replace this with the location you would like to use for your output file, and valid file extension (usually .csv for scoring results, or .ilearner for trained models)*/
}
},
},
GlobalParameters = new Dictionary<string, string>() {
}
};
client.DefaultRequestHeaders.Authorization = new AuthenticationHeaderValue("Bearer", apiKey);
// WARNING: The 'await' statement below can result in a deadlock
// if you are calling this code from the UI thread of an ASP.Net application.
// One way to address this would be to call ConfigureAwait(false)
// so that the execution does not attempt to resume on the original context.
// For instance, replace code such as:
// result = await DoSomeTask()
// with the following:
// result = await DoSomeTask().ConfigureAwait(false)
Console.WriteLine("Submitting the job...");
// submit the job
var response = await client.PostAsJsonAsync(BaseUrl + "?api-version=2.0", request);
if (!response.IsSuccessStatusCode)
{
await WriteFailedResponse(response);
return;
}
string jobId = await response.Content.ReadAsAsync<string>();
Console.WriteLine(string.Format("Job ID: {0}", jobId));
// start the job
Console.WriteLine("Starting the job...");
response = await client.PostAsync(BaseUrl + "/" + jobId + "/start?api-version=2.0", null);
if (!response.IsSuccessStatusCode)
{
await WriteFailedResponse(response);
return;
}
string jobLocation = BaseUrl + "/" + jobId + "?api-version=2.0";
Stopwatch watch = Stopwatch.StartNew();
bool done = false;
while (!done)
{
Console.WriteLine("Checking the job status...");
response = await client.GetAsync(jobLocation);
if (!response.IsSuccessStatusCode)
{
await WriteFailedResponse(response);
return;
}
BatchScoreStatus status = await response.Content.ReadAsAsync<BatchScoreStatus>();
if (watch.ElapsedMilliseconds > TimeOutInMilliseconds)
{
done = true;
Console.WriteLine(string.Format("Timed out. Deleting job {0} ...", jobId));
await client.DeleteAsync(jobLocation);
}
switch (status.StatusCode) {
case BatchScoreStatusCode.NotStarted:
Console.WriteLine(string.Format("Job {0} not yet started...", jobId));
break;
case BatchScoreStatusCode.Running:
Console.WriteLine(string.Format("Job {0} running...", jobId));
break;
case BatchScoreStatusCode.Failed:
Console.WriteLine(string.Format("Job {0} failed!", jobId));
Console.WriteLine(string.Format("Error details: {0}", status.Details));
done = true;
break;
case BatchScoreStatusCode.Cancelled:
Console.WriteLine(string.Format("Job {0} cancelled!", jobId));
done = true;
break;
case BatchScoreStatusCode.Finished:
done = true;
Console.WriteLine(string.Format("Job {0} finished!", jobId));
ProcessResults(status);
break;
}
if (!done) {
Thread.Sleep(1000); // Wait one second
}
}
}