SHORT PAPER FAST AND FAIR HANDLING OF MULTIMEDIA CAPTCHA FLOWS Fast and Fair Handling of Multimedia CAPTCHA Flows http://dx.doi.org/10.3991/ijim.v9i4.4644 A.N. Tsioliaridou1, C. Zhang2 and C.K. Liaskos3 1Democritus University of Thrace, Xanthi, Greece. 2Florida International University, USA. 3Aristotle University, Thessaloniki, Greece. Abstract—Multimedia CAPTCHAs play a crucial role in bot filtering policies for VoIP applications. Network flows carrying CAPTCHA content have a very small lifespan and require high transmission quality, making for special handling at transport layer. The present study introduces an analysis-derived, rapidly converging control mechanism for CAPTCHA flows. Instead of relying on generic heuristics, the novel scheme enables the flows to estimate their deviation from the equilibrium and adapt to it in a single step. Simulations demonstrate the high efficiency and convergence speed of the scheme, highlighting its unique fitness for the appointed task. Index Terms—Network congestion control, Fairness, Convergence speed, CAPTCHA application. I. INTRODUCTION Voice over IP services are envisioned as the future of telephony and online communications. However, thinning the borders between computer networks and telephony has its risks. Modern VoIP networks have to account for “bots”, i.e. pieces of software that perform automated calls for purposes ranging from advertisement to denial of service [1]. Call !ltering policies are employed in retaliation. An integral part of these policies are the CAPTCHAs, a form of reverse Turing test which is administered as a means of proving that the caller is indeed human. CAPTCHAs generally come in multimedia form and contain simple puzzles to be solved, involving text, video and sound recognition in the process [2]. The proper management of CAPTCHA "ows at transport layer is the focus of the present study. Performing CAPTCHA tests on users decreases their quality of experience. This situation can be aggravated further under presence of network issues. Audio/video problems on the received content could result into test failure, leading to caller blacklisting, blocking or banning, as per the enforced policy. Long delays in reception are not acceptable either, since the tests are administered with time restrictions. Using an error control-capable transport protocol (TCP variant) is in general mandatory. Thus, the problem can be reduced to proper bandwidth allocation for the CAPTCHA-conveying network "ows. However, a unique attribute of a CAPTCHA "ow is its small size. The actual media content typically varies from a few KBytes to a few MBytes. Thus, a desirable attribute of a proper "ow management scheme is the fast convergence to equilibrium. The coexistence with "ows with larger lifespans (e.g. FTP, streaming) could pose an issue [3]. A commonly used approach is then to employ separate router buffers for the short-lived "ows [4]. The same outcome can also be accomplished by placing the "ows into virtual dedicated channels [5]. Nonetheless, router buffer allocations, or dedicated bandwidth, are a scarce and valuable resource. The corresponding CAPTCHA "ows will eventually saturate the available buffers, causing congestion. The present work offers a novel way of CAPTCHA "ow management at transport layer. The study is motivated by the results of [6], which shows that the delay and jitter per "ow are minimized when all "ows are assigned their fair share of the bandwidth. A mechanism is presented that allows each "ow independently to estimate its deviation from its fair- share. More particularly, each "ow becomes aware of whether it has operated beyond, below or around its fair- share, which allows them to determine the next congestion control strategy for fast convergence to fairness. We then introduce the necessary window adjustment rules. According to the Fair-Share rule each "ow can estimate and rapidly operate at its fair-share. Simulation results con!rm that, in the presence of the Fair-Share rule, the system converges in one congestion cycle (one epoch) to equilibrium. II. ANALYSIS We assume the typical bottleneck topology of Fig. 1. A number ! ! !!!!of "ows compete for the link and adjust their congestion windows, !!!!!, following the Additive Increase-Multiplicative Decrease rule [7]. The system model is characterized by synchronous "ow noti!cation for congestion events, which is a classic assumption as well [8]. We de!ne the instantaneous Throughput (!) of the !!! "ow at time t as: ( ) ( ) ( ) ( ) ( ) i i i o d w t w t T t R t R q t = = + (1) where R(t) is the time-variant round trip time, comprising the queuing delay !!!!! and a constant factor Ro representing physical attributes: ( )sink2o src bR d d d= + + (2) The flow windows undergo the following repeating procedure. Initially, all flows are in the additive increase stage. At the time the bandwidth is exhausted and the buffer has overflowed (cliff point) the network signals the senders to decrease their data rate. Consider a flow i that competes for the bwb bandwidth. After a successful delivery of a packet, it receives the relevant ACK from the receiver. Consider two ACKs that it receives between the system knee and cliff points [8], one at time t1 and another at time t2. The throughput of the ith flow at time t1 and t2 are Ti(t1) and Ti(t2) respectively, and !!!!!!! is: 64 http://www.i-jim.org SHORT PAPER FAST AND FAIR HANDLING OF MULTIMEDIA CAPTCHA FLOWS Figure 1. A typical model of n users sharing a link. bw(*) and d(*) denote the bandwidth and delay respectively. BS is the buffer size at the router. 1 2 1 2 2 1 2 1 ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) i i i i i i w t w t w t T t T t T t R t R t +! ! = " = " = 2 1 1 2 1 ( ) ( ) ( ) [ ( )][ ( )] i i q o q o q w t R t w t d R d t R d t ! " ! + + (3) where !!! is defined as: 2 1 2 2 1 ( ) ( ) ( ) ( ) n k k q q q b w t d t d t d t bw = ! " #$ % & '# = ( = ) (4) Consequently from (3) and (4): 2 1 2 1 1 2 2 1 ( ) ( ) ( ) ( ) ( ) [ ( )][ ( )] n k k i i i o q o q w t w t R t w t bwT t R d t R d t = ! " #$ % & '# ( # = = + + ) 2 1 1 2 1 2 1 ( ) ( ) ( ) ( ) [ ( )][ ( )] n i i k k o q o q w t R t bw w t w t R d t R d t bw = ! " # $ #% & ' ( + + ) (5) Remark 1. If network resources were allocated equally among competing flows, they would all experience the same increase of their congestion window, and equal to !!!!!!! during the time period from t1 to t2, defined as: 2 2 1 ( ) ( ) n k i k w t n w t = ! " # = #$ % & ' ( (6) Due to this remark, equation (5) becomes: 2 1 1 2 2 2 1 ( ) ( ) ( )! ! ( ) ( ) [ ( )][ ( )] i i i i o q o q w t R t bw w t n w t T t R d t R d t bw ! " ! ! = = + + 2 1 2 1 ( ) ! ( ) [1 ] [ ( )] ( )! i i o q w t nw t R d t R t bw ! " # + (7) 2 1 2 2 1 1 ( ) ! ( ) ( ) [1 ] [ ( )] ( ) i i i n o q k k w t nw t T t R d t w t = ! ! = " + # (8) Note that the term !!!!!!! !!! ! !!!!!!! in (8) is always positive, while the term ! ! ! ! !!!!!! !!!!!! ! !!! might be either positive or negative. Corollary 2. Equation (8) enables the ith flow to determine whether it operates below, beyond or at its fair- share as follows: Figure 2. Flow operation above (left) and below (right) its fair-share. • If !T(t2)<0, the i th flow at time t1 has reached a rate above its fair-share, since 1 1 1 ( ) ! ( ) n k i k w t nw t = 0, the rate of i th flow at time t1 is below its fair-share, since 1 1 1 ( ) ! ( ) n k i k w t nw t = >! . • If !T(t2)=0, the i th flow has reached its fair-share, since 1 1 1 ( ) ! ( ) n k i k w t nw t = =! . Therefore, equation (8) establishes a criterion, based on which each flow independently can review its transmission rate and deduce whether it operates greedily, fairly or suffering mistreatment. Theorem 3. In order to operate at its fair-share, the flow should adjust its congestion window, according to equation: !Bfair share o B w u w R R! + = (9) where A B B A B A R w R wu R R ! = ! . Proof. We examine the cases defined by Corollary 2. Notations A and B represent two measurement samples during the jth epoch at times tA and tB, where tARo. 1st case (operation above the fair-share, left part of Fig. 2). Consider two times, tA and tB during the j th epoch, where tknee