04.07.2013 Views

Thèse de doctorat: Algorithmes de classification répartis sur le cloud

Thèse de doctorat: Algorithmes de classification répartis sur le cloud

Thèse de doctorat: Algorithmes de classification répartis sur le cloud

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

tel-00744768, version 1 - 23 Oct 2012<br />

6.4. VQ PARALLELIZATION SCHEME 131<br />

towards critical points are guaranteed.<br />

The results are gathered with different values of τ (τ ∈ {1, 10, 100}) in the<br />

charts displayed in Figure 6.6. For each chart, we plot the performance curves<br />

when the distributed architecture has a different number of computing instances:<br />

M ∈ {1, 2, 10}. The curve of reference is the sequential execution of the VQ,<br />

that is M = 1. We can notice that for every value of τ ∈ {1, 10, 100}, multip<strong>le</strong><br />

resources do not bring speedup for convergence. Even if more data are processed,<br />

there is no increase in the convergence speed. To conclu<strong>de</strong>, no gain in terms of<br />

wall time can be brought using this paral<strong>le</strong>l scheme. In the next subsection we<br />

investigate the cause of these non-satisfactory results and propose a solution to<br />

overcome this prob<strong>le</strong>m.<br />

6.4.2 Towards a better paral<strong>le</strong>lization scheme<br />

Let us start the investigation of the paral<strong>le</strong>l scheme given by iterations (6.6)<br />

by rewriting the sequential VQ iterations (6.2) using the notation introduced in<br />

Section 6.1. For t ≥ τ it holds<br />

t<br />

w(t + 1) = w(t − τ + 1) − εt ′ +1H z{t ′ +1 mod n}, w(t ′ ) . (6.7)<br />

t ′ =t−τ+1<br />

For iterations (6.6), consi<strong>de</strong>r a time slot t ≥ 0 where an averaging phase occurs,<br />

that is, t mod τ = 0 and t > 0. Then, for all i ∈ {1, . . . , M},<br />

w i (t + 1) = w i t<br />

(t − τ + 1) − εt ′ <br />

M 1<br />

+1 H<br />

M<br />

z j<br />

t ′ +1 , wj (t ′ ) <br />

. (6.8)<br />

t ′ =t−τ+1<br />

To the empirical distortion <strong>de</strong>fined by equation (6.5) is associated its theoretical<br />

counterpart the distortion function C (see for examp<strong>le</strong> [98] or [31]). In the<br />

first situation of iterations (6.7), for a samp<strong>le</strong> z and t ′ > 0, H(z, w(t ′ )) is an<br />

observation and an estimator of the gradient ∇C(w(t ′ )) (see e.g. [95]). In the<br />

second situation of iterations (6.8), <strong>le</strong>t us assume that the multip<strong>le</strong> versions are<br />

close to each other. This means that w j (t ′ ) ≈ w i (t ′ ), for all (i, j) ∈ {1, . . . , M} 2 .<br />

Thus, the average<br />

1<br />

M<br />

M<br />

H<br />

j=1<br />

j=1<br />

<br />

z j<br />

{t ′ +1 mod n} , wj (t ′ <br />

)<br />

can also be viewed as an estimation of the gradient ∇C(w i (t ′ )), where i ∈<br />

{1, . . . , M}.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!