Azure ML web服务超时

本文关键字:超时 服务 web ML Azure | 更新日期: 2023-09-27 18:00:30

我在Azure ML中创建了一个简单的实验,并用http客户端触发它。在Azure ML工作区中,执行时一切正常。然而,当我使用http客户端触发实验时,实验超时并失败。为http客户端设置超时值似乎不起作用。

我们有没有办法设置这个超时值,这样实验就不会失败?

Azure ML web服务超时

确保您正确设置了客户端超时值。如果为web服务供电的服务器超时,那么它将发回一个HTTP状态代码为504 BackendScoreTimeout(或可能为409 GatewayTimeout)的响应。然而,如果你根本没有收到回复,那么你的客户等待的时间不够长。

通过在ML Studio中运行实验,您可以找到大量的时间。转到实验属性,了解它运行了多长时间,然后将大约两倍的时间作为超时值。

我在作为web服务发布的Azure ML实验中也遇到过类似的问题。大多数情况下,它运行正常,而有时返回时会出现超时错误。问题是实验本身有一个90秒的运行时间限制。所以,很可能你的实验运行时间超过了这个限制,并返回超时错误。hth

似乎无法根据截至2018年4月1日仍标记为"计划"的功能请求设置此超时。

MSDN论坛从2017年开始建议使用批量执行服务,该服务启动机器学习实验,然后异步询问是否完成。

以下是来自Azure ML Web服务管理示例代码的代码片段(所有注释都来自其示例代码):

        using (HttpClient client = new HttpClient())
        {
            var request = new BatchExecutionRequest()
            {
                Outputs = new Dictionary<string, AzureBlobDataReference> () {
                    {
                        "output",
                        new AzureBlobDataReference()
                        {
                            ConnectionString = storageConnectionString,
                            RelativeLocation = string.Format("{0}/outputresults.file_extension", StorageContainerName) /*Replace this with the location you would like to use for your output file, and valid file extension (usually .csv for scoring results, or .ilearner for trained models)*/
                        }
                    },
                },    
                GlobalParameters = new Dictionary<string, string>() {
                }
            };
            client.DefaultRequestHeaders.Authorization = new AuthenticationHeaderValue("Bearer", apiKey);
            // WARNING: The 'await' statement below can result in a deadlock
            // if you are calling this code from the UI thread of an ASP.Net application.
            // One way to address this would be to call ConfigureAwait(false)
            // so that the execution does not attempt to resume on the original context.
            // For instance, replace code such as:
            //      result = await DoSomeTask()
            // with the following:
            //      result = await DoSomeTask().ConfigureAwait(false)
            Console.WriteLine("Submitting the job...");
            // submit the job
            var response = await client.PostAsJsonAsync(BaseUrl + "?api-version=2.0", request);
            if (!response.IsSuccessStatusCode)
            {
                await WriteFailedResponse(response);
                return;
            }
            string jobId = await response.Content.ReadAsAsync<string>();
            Console.WriteLine(string.Format("Job ID: {0}", jobId));
            // start the job
            Console.WriteLine("Starting the job...");
            response = await client.PostAsync(BaseUrl + "/" + jobId + "/start?api-version=2.0", null);
            if (!response.IsSuccessStatusCode)
            {
                await WriteFailedResponse(response);
                return;
            }
            string jobLocation = BaseUrl + "/" + jobId + "?api-version=2.0";
            Stopwatch watch = Stopwatch.StartNew();
            bool done = false;
            while (!done)
            {
                Console.WriteLine("Checking the job status...");
                response = await client.GetAsync(jobLocation);
                if (!response.IsSuccessStatusCode)
                {
                    await WriteFailedResponse(response);
                    return;
                }
                BatchScoreStatus status = await response.Content.ReadAsAsync<BatchScoreStatus>();
                if (watch.ElapsedMilliseconds > TimeOutInMilliseconds)
                {
                    done = true;
                    Console.WriteLine(string.Format("Timed out. Deleting job {0} ...", jobId));
                    await client.DeleteAsync(jobLocation);
                }
                switch (status.StatusCode) {
                    case BatchScoreStatusCode.NotStarted:
                        Console.WriteLine(string.Format("Job {0} not yet started...", jobId));
                        break;
                    case BatchScoreStatusCode.Running:
                        Console.WriteLine(string.Format("Job {0} running...", jobId));
                        break;
                    case BatchScoreStatusCode.Failed:
                        Console.WriteLine(string.Format("Job {0} failed!", jobId));
                        Console.WriteLine(string.Format("Error details: {0}", status.Details));
                        done = true;
                        break;
                    case BatchScoreStatusCode.Cancelled:
                        Console.WriteLine(string.Format("Job {0} cancelled!", jobId));
                        done = true;
                        break;
                    case BatchScoreStatusCode.Finished:
                        done = true;
                        Console.WriteLine(string.Format("Job {0} finished!", jobId));
                        ProcessResults(status);
                        break;
                }
                if (!done) {
                    Thread.Sleep(1000); // Wait one second
                }
            }
        }