31.10.2012 Views

NON-ADDICTIVE SAMPLE BASED FACTS - Deetc

NON-ADDICTIVE SAMPLE BASED FACTS - Deetc

NON-ADDICTIVE SAMPLE BASED FACTS - Deetc

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

code to implement one of the report. In the particular<br />

case, the query execution display the rating, for the<br />

main targets, by time periods, for one specific channel.<br />

WITH MEMBER Measures . TargetTotal as<br />

’ LookupCube (" ContactTarget ",<br />

"( Measures . weight ,"+ membertostr (<br />

Target . currentmember )+")") ’<br />

MEMBER Measures . rat as<br />

’ Measures . weight / Measures . TargetTotal<br />

/ Measures . num_min ’<br />

, FORMAT_STRING =’ Percent ’<br />

SELECT { Target . currentmember } ON COLUMNS ,<br />

time . Period . Members ON ROWS<br />

FROM AudienceTarget<br />

WHERE ( Measures .rat , Program . Canal .[2])<br />

Listing 1: The rating for the main targets by time periods<br />

for channel 2 using the target aggregate<br />

The code is rather simple because each target<br />

weight is pre-calculated in the ContactTarget aggregate.<br />

The LookupCube function lookup the value for<br />

each target querying it. With the absence of this cube,<br />

each target weight is calculate in runtime, looking up<br />

each socio-demographic variable that made up the target.<br />

Not only the execution times rise up, but also the<br />

query’ code.<br />

4 Conclusion<br />

This work presents a dimensional model capable<br />

to address the specificities of quota sample based data.<br />

The goal was twofold; first, to demonstrate that is<br />

possible to address audience analysis requirements,<br />

using non-proprietary repositories and technologies;<br />

second, to present a possible solution to other domains<br />

where data is also quota sample based. In the<br />

audience domain, the data tendencies are corrected by<br />

a daily individual weight, that must be taken into account<br />

if the indicators are meant to be representative<br />

to the entire population, not just the panel’s individuals.<br />

To deal with the audience performance indicators,<br />

mostly non-addictive and quota depended, is necessary<br />

to normalise each one with a reference value, calculated<br />

from the corrective daily weights. The present<br />

solution lay on the creation of an auxiliary contact table<br />

to store daily weights for each possible combination<br />

of socio-demographis values. The authors transform<br />

this way the non-addictive facts into addictive<br />

ones, sacrificing the capability of pre-calculated their<br />

values and store them into a fact table directly. All of<br />

the calculus must be done in runtime. To ensure an<br />

efficient solution, is necessary to create a series of domain<br />

dependant aggregates, with specific dimension<br />

models. The performance indicators test results, using<br />

both proprietary program and generic OLAP tools<br />

with the discussed model, have matched.<br />

Authors think the same methodology is appropriate<br />

to other domains if the data is quota sample based<br />

and the performance indicators values are always relative<br />

to the subset of the sample used in their calculus.<br />

REFERENCES<br />

[1] Yvonne M. M. Bishop, E. F. Fienberg, and P. W. Holland.<br />

Discrete multivariate analysis : theory and practice.<br />

The MIT Press, 1975.<br />

[2] EF Codd, SB Codd, and CT Sally. Providing<br />

OLAP to user-analysis. Technical report,<br />

http://www.arborsoft.com/essbase/wht ppr/coddps.<br />

zip, 1993.<br />

[3] Nuno Datia. Aplicação de técnicas de apoio à decisão<br />

a dados de audimetria. Master’s thesis, Faculdade de<br />

Ciências e Tecnologia - Universidade Nova de Lisboa,<br />

2006.<br />

[4] W. Edwards Deming and Frederick F. Stephan. On a<br />

least squares adjustment of a sampled frequency table<br />

when the expected marginal totals are known. Annals<br />

of Mathematical Statistics, 11(4):427–444, 1940.<br />

[5] C. Imhoff, N. Galemmo, and J.G. Geiger. Mastering<br />

Data Warehouse Design: Relational and Dimensional<br />

Techniques. Wiley, 2003.<br />

[6] WH Inmon. Building the data warehouse. John Wiley<br />

& Sons, Inc. New York, NY, USA, 2005.<br />

[7] N. Jukic, B. Jukic, and M. Malliaris. Online Analytical<br />

Processing (OLAP) for Decision Support. In<br />

Handbook on Decision Support Systems. Springer,<br />

2008.<br />

[8] R. Kimball, L. Reeves, M. Ross, and W. Thornthwaite.<br />

The Data Warehouse Lifecycle Toolkit. Wiley, 1998.<br />

[9] R. Kimball and M. Ross. The Data Warehouse Toolkit:<br />

The Complete Guide to Dimensional Modeling. Wiley,<br />

2002.<br />

[10] MediaSoft Kimono. http:// www. kubik. it/<br />

kimono_ en. html , last acessed on May 2008.<br />

[11] Sql Server Analysis Services. http://<br />

technet. microsoft. com/ pt-br/ sqlserver/<br />

bb671220(en-us). aspx , last acessed on May 2008.<br />

[12] Sql Server Data TRansformations Services. http://<br />

www. microsoft. com/ technet/ prodtechnol/ sql/<br />

2000/ deploy/ dtssql2k. mspx , last acessed on May<br />

2008.<br />

[13] Markdata Telereport. http:// www. markdata. net/<br />

v2/ , last acessed on May 2008.<br />

[14] Rene Weber. Methods to Forecast Television Viewing<br />

Patterns for Target Audiences. In Communication Research<br />

in Europe and Abroad –Challenges of the First<br />

Decade, 2003.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!