QACO: Exploiting partial execution in web servers

Jinhan Kim, Sameh Elnikety, Yuxiong He, Seung Won Hwang, Shaolei Ren

Research output: Chapter in Book/Report/Conference proceedingConference contribution

4 Citations (Scopus)

Abstract

Web servers provide content to users, with the requirement of providing high response quality within a short response time. Meeting these requirements is challenging, especially in the event of load spikes. Meanwhile, we observe that a response to a request can be adapted or partially executed depending on current resource availability at the server. For example, a web server can choose to send a low or medium resolution image instead of sending the original high resolution image under resource contention. In this paper, we exploit partial execution to expose a trade off between resource consumption and service quality. We show how to manage server resources to improve service quality and responsiveness. Specifically, we develop a framework, called Quota-based Control Optimization (QACO). The quota represents the total amount of resources available for all pending requests. QACO consists of two modules: (1) A control module adjusts the quota to meet the response time target. (2) An optimization module exploits partial execution and allocates the quota to pending requests in a manner that improves total response quality. We evaluate the framework using a system implementation in the Apache Web server, and using a simulation study of a Video-on-Demand server. The results show that under a response time target, QACO achieves a higher response quality than traditional techniques that admit or reject requests without exploiting partial execution.

Original languageEnglish
Title of host publicationProceedings of the 2013 ACM Cloud and Autonomic Computing Conference, CAC 2013
DOIs
Publication statusPublished - 2013 Sep 16
Event2013 ACM International Conference on Cloud and Autonomic Computing, CAC 2013 - Miami, FL, United States
Duration: 2013 Aug 52013 Aug 9

Publication series

NameACM International Conference Proceeding Series

Other

Other2013 ACM International Conference on Cloud and Autonomic Computing, CAC 2013
CountryUnited States
CityMiami, FL
Period13/8/513/8/9

Fingerprint

Servers
Image resolution
Response time (computer systems)
Video on demand
Availability

All Science Journal Classification (ASJC) codes

  • Software
  • Human-Computer Interaction
  • Computer Vision and Pattern Recognition
  • Computer Networks and Communications

Cite this

Kim, J., Elnikety, S., He, Y., Hwang, S. W., & Ren, S. (2013). QACO: Exploiting partial execution in web servers. In Proceedings of the 2013 ACM Cloud and Autonomic Computing Conference, CAC 2013 (ACM International Conference Proceeding Series). https://doi.org/10.1145/2494621.2494636
Kim, Jinhan ; Elnikety, Sameh ; He, Yuxiong ; Hwang, Seung Won ; Ren, Shaolei. / QACO : Exploiting partial execution in web servers. Proceedings of the 2013 ACM Cloud and Autonomic Computing Conference, CAC 2013. 2013. (ACM International Conference Proceeding Series).
@inproceedings{11e8e9eda9ee4fd6b197c2750d24ef88,
title = "QACO: Exploiting partial execution in web servers",
abstract = "Web servers provide content to users, with the requirement of providing high response quality within a short response time. Meeting these requirements is challenging, especially in the event of load spikes. Meanwhile, we observe that a response to a request can be adapted or partially executed depending on current resource availability at the server. For example, a web server can choose to send a low or medium resolution image instead of sending the original high resolution image under resource contention. In this paper, we exploit partial execution to expose a trade off between resource consumption and service quality. We show how to manage server resources to improve service quality and responsiveness. Specifically, we develop a framework, called Quota-based Control Optimization (QACO). The quota represents the total amount of resources available for all pending requests. QACO consists of two modules: (1) A control module adjusts the quota to meet the response time target. (2) An optimization module exploits partial execution and allocates the quota to pending requests in a manner that improves total response quality. We evaluate the framework using a system implementation in the Apache Web server, and using a simulation study of a Video-on-Demand server. The results show that under a response time target, QACO achieves a higher response quality than traditional techniques that admit or reject requests without exploiting partial execution.",
author = "Jinhan Kim and Sameh Elnikety and Yuxiong He and Hwang, {Seung Won} and Shaolei Ren",
year = "2013",
month = "9",
day = "16",
doi = "10.1145/2494621.2494636",
language = "English",
isbn = "9781450321723",
series = "ACM International Conference Proceeding Series",
booktitle = "Proceedings of the 2013 ACM Cloud and Autonomic Computing Conference, CAC 2013",

}

Kim, J, Elnikety, S, He, Y, Hwang, SW & Ren, S 2013, QACO: Exploiting partial execution in web servers. in Proceedings of the 2013 ACM Cloud and Autonomic Computing Conference, CAC 2013. ACM International Conference Proceeding Series, 2013 ACM International Conference on Cloud and Autonomic Computing, CAC 2013, Miami, FL, United States, 13/8/5. https://doi.org/10.1145/2494621.2494636

QACO : Exploiting partial execution in web servers. / Kim, Jinhan; Elnikety, Sameh; He, Yuxiong; Hwang, Seung Won; Ren, Shaolei.

Proceedings of the 2013 ACM Cloud and Autonomic Computing Conference, CAC 2013. 2013. (ACM International Conference Proceeding Series).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

TY - GEN

T1 - QACO

T2 - Exploiting partial execution in web servers

AU - Kim, Jinhan

AU - Elnikety, Sameh

AU - He, Yuxiong

AU - Hwang, Seung Won

AU - Ren, Shaolei

PY - 2013/9/16

Y1 - 2013/9/16

N2 - Web servers provide content to users, with the requirement of providing high response quality within a short response time. Meeting these requirements is challenging, especially in the event of load spikes. Meanwhile, we observe that a response to a request can be adapted or partially executed depending on current resource availability at the server. For example, a web server can choose to send a low or medium resolution image instead of sending the original high resolution image under resource contention. In this paper, we exploit partial execution to expose a trade off between resource consumption and service quality. We show how to manage server resources to improve service quality and responsiveness. Specifically, we develop a framework, called Quota-based Control Optimization (QACO). The quota represents the total amount of resources available for all pending requests. QACO consists of two modules: (1) A control module adjusts the quota to meet the response time target. (2) An optimization module exploits partial execution and allocates the quota to pending requests in a manner that improves total response quality. We evaluate the framework using a system implementation in the Apache Web server, and using a simulation study of a Video-on-Demand server. The results show that under a response time target, QACO achieves a higher response quality than traditional techniques that admit or reject requests without exploiting partial execution.

AB - Web servers provide content to users, with the requirement of providing high response quality within a short response time. Meeting these requirements is challenging, especially in the event of load spikes. Meanwhile, we observe that a response to a request can be adapted or partially executed depending on current resource availability at the server. For example, a web server can choose to send a low or medium resolution image instead of sending the original high resolution image under resource contention. In this paper, we exploit partial execution to expose a trade off between resource consumption and service quality. We show how to manage server resources to improve service quality and responsiveness. Specifically, we develop a framework, called Quota-based Control Optimization (QACO). The quota represents the total amount of resources available for all pending requests. QACO consists of two modules: (1) A control module adjusts the quota to meet the response time target. (2) An optimization module exploits partial execution and allocates the quota to pending requests in a manner that improves total response quality. We evaluate the framework using a system implementation in the Apache Web server, and using a simulation study of a Video-on-Demand server. The results show that under a response time target, QACO achieves a higher response quality than traditional techniques that admit or reject requests without exploiting partial execution.

UR - http://www.scopus.com/inward/record.url?scp=84883738488&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84883738488&partnerID=8YFLogxK

U2 - 10.1145/2494621.2494636

DO - 10.1145/2494621.2494636

M3 - Conference contribution

AN - SCOPUS:84883738488

SN - 9781450321723

T3 - ACM International Conference Proceeding Series

BT - Proceedings of the 2013 ACM Cloud and Autonomic Computing Conference, CAC 2013

ER -

Kim J, Elnikety S, He Y, Hwang SW, Ren S. QACO: Exploiting partial execution in web servers. In Proceedings of the 2013 ACM Cloud and Autonomic Computing Conference, CAC 2013. 2013. (ACM International Conference Proceeding Series). https://doi.org/10.1145/2494621.2494636