Forward Modelling of Resistivity Logs using Neural Networks - Liacs

ForwardModellingofResistivityLogs usingNeuralNetworks August31,1995 P.Demmenie

ationofrocks.Tothisend,ameasuringdeviceisloweredinawellboreto Abstract 

recordaso-calledresistivitylog.Duetoenvironmentaleectsthisresistivitylogdiersfromtheresistivityoftheformation,thetrueresistivity.To 

hydrocarbons(oilandgas)andwatertodeterminethehydrocarbonsatu- 

Intheoilindustry,oneusesthedierenceinelectricalconductivitybetween 

invertthemeasuredlogtothetrueresistivityoneusesaniterativeforward modellingprocess,involvingthenumericalsolutionofdierentialequations. Althoughthecurrentmodellingalgorithmshavesignicantlyimprovedin 

Theone-waymappingbetweentheearthmodels(trueresistivitymodeland neuralnetworksareveryfastinproducingoutputtocertaininput. siononawellsite.Therefore,wehaveinvestigatedthefeasibilityofusing neuralnetworkstoperformtheforwardmodellingprocess.Oncetrained, speedcomparingtoafewyearsago,theyarestillnotfastenoughforinver- 

net(6750).Thegeneralizationperformanceofthesenetsisnotsucient numberofconnectionsbetweentheinputlayerandthehiddenlayerofthe ofinputsthatisneededtorepresenttheearthmodel(450)andthehigh dard"fullyconnectednet.However,problemsarisefromthehighnumber environmentalconditions)andthetoolresponsecanbelearnedbya\stan- 

forthepurposeofresistivityloginversion. 

offullyconnectednets.Wehavefoundanarchitecturewhichprovestobe numberofconnectionsinthenetbyusinglocallyconnectednetsinstead principalcomponentanalysisandthewavelettransform.Thispreprocessing oftheinputisquitesuccessful.Wehavealsostudiedmethodstoreducethe Wehavestudiedtwopreprocessingmethodstoreducethenumberofinputs: 

thesesharedweightsthehiddenlayerperformsaconvolutionoftheinput ofnetisbasedonthelocallyconnectednetsandsharedweights.Dueto quiteecientforthismapping:aconvolutional-regressionnet.Thistype layer.Theconvolutionkernel,thesetofsharedweights,islearnedbythe netitself.Theweightsharingreducesthenumberofconnectionsinthenet forwardmodelthatisusedatKSEPLnowadays.Theperformanceofthenet ismeasuredintheaveragerelativeerrorbetweenthenetworkoututandthe andtheseconstraintsimprovethegeneralizationabilityofthenet. Thisconvolutional-regressionnetisapproximately100timesfasterthanthe whentheaveragerelativeerrorliesbelow5%.Theconvolutional-regression forwardmodeloutput.Theneuralnetcanbeusedasafastforwardmodel, netachievesanaccuracyofapproximately8%onaresistivitylogcoming froma\real"oilwell.Furtherimprovementsinaccuracycanbeachieved byusingamorerepresentativetrainingset.Evenwithlessaccuracythe neuralnetcouldbeusedasinitialstartfortheforwardmodellingprocess. 

i

Preface ThisreportisthemasterthesisofPamelaDemmenieforgraduatingfrom theRijksUniversiteitLeiden(RUL)atthedepartmentofComputerScience. andDr.IdaSprinkhuizen-Kuyper(RUL). ItdescribesaprojectthatwasperformedattheKoninklijkeShellExploratie enProduktieLaboratorium(KSEPL)inRijswijkfromDecember1994until August1995.TheprojectwassupervisedbyDr.GuozhongAn(KSEPL) 

Chapter3and4describetheactualexperimentsfordatawithoutandwith groundonforwardmodellingandashortintroductioninneuralnetworks. Thereportisdividedintosixchapters.Therstchapterprovidesaback- 

invasion.Inchapter5wepresenttheresultsofthetrainedneuralnetworks Thesecondchapterdescribesallthemethodswehaveusedintheexperiments.Thesemethodsinvolveinputreductionsandarchitectureconstraints. 

onrealisticloggingdataandinchapter6wesummarizetheconclusionsof thisproject. AllneuralnetsimulationswererunonSparc10andSparc20(Unix)stationsandshouldeventuallyberunonanIBMR600workstation(theapproximationtimesweremeasuredonthistypeofworkstation).Weused 

withothersimulators:StuttgartNeuralNetworkSimulator(SNNS)and Aspirin/MIGRAINES.Theadvantagesanddisadvantagesofthesenetwork theXerionsimulator,versions3.1and4.0.Wehavealsoexperimented 

Sprinkhuizen-Kuyperfortheirsupportandideasduringthisproject.Of Acknowledgements FirstofallIwouldliketothankmysupervisorsGuozhongAnandIda simulatorsareoutlinedinappendixA. 

vanDijkandLeonHoman,whohaveprovidedthedatathatwehavebeen coursetheprojectwouldnothavebeencompletedwithoutthehelpofNiels workingwithandhelpedmeinmyunderstandingoftheforwardmodelling versityfortheirsupportandcompanyduringtheninemonthsIhavebeen process.IalsoliketothankthestudentsfromKSEPLandfromtheuni- 

PimBussinkforword2onpage42. workingontheproject.Andlastbutnotleasteverlastinggratitudegoto 

ii

Contents Abstract Preface 1ForwardModellingofResistivityLogs ii 1i 

1.1ResistivityLogging::::::::::::::::::::::::1 1.2InversionbyForwardModelling:::::::::::::::::5 1.2.1Inversion:::::::::::::::::::::::::5 1.1.1Loggingtools:::::::::::::::::::::::1 

1.2.2ForwardModelling::::::::::::::::::::6 1.1.2Environmentaleects::::::::::::::::::3 

1.3NeuralNetworks:::::::::::::::::::::::::7 1.3.3Thetrainingset:::::::::::::::::::::10 1.3.1Networkarchitecture:::::::::::::::::::9 1.3.4Generalization::::::::::::::::::::::11 1.3.2TheLearningMethod::::::::::::::::::9 

2Inputrepresentationandarchitecturedesign 1.4ForwardModellingusingaNeuralNetwork::::::::::12 2.1Inputrepresentation:::::::::::::::::::::::15 1.3.5Localminima:::::::::::::::::::::::12 

2.1.1Discretizedslidingwindow::::::::::::::::16 2.1.2Attributes:::::::::::::::::::::::::18 15 

2.2Preprocessing:::::::::::::::::::::::::::20 2.1.3Inputandoutputscaling::::::::::::::::19 

2.3Architectureconstraints:::::::::::::::::::::27 2.3.1Fullyconnectednets:::::::::::::::::::27 2.2.1PrincipalComponentAnalysis:::::::::::::20 

2.3.2Locallyconnectednets::::::::::::::::::28 2.2.2Wavelettransform::::::::::::::::::::22 

2.3.6Convolutional-regressionnetwork::::::::::::34 2.3.5TimeDelayednetwork::::::::::::::::::33 2.3.4Convolutionalnetwork::::::::::::::::::32 2.3.3Symmetryconstraints::::::::::::::::::28 

3Forwardmodellingwithoutmudinvasion 2.4Errorfunction:::::::::::::::::::::::::::35 3.2Experimentingwiththenetworkarchitecture:::::::::38 3.3Experimentingwiththesizeoftheslidingwindow::::::39 3.1Experimentingwithdierentscalingmethods:::::::::38 37 

3.4Summaryandresults:::::::::::::::::::::::39

4Forwardmodellingwithmudinvasion 4.3Experimentingwithinputreductionmethods:::::::::46 4.2Experimentingwithdierentinputrepresentations::::::44 4.1Scalingoftheparameters::::::::::::::::::::43 4.3.1Usingdierentsamplingmethods::::::::::::46 42 

4.4Creatingamorerepresentativetrainingset::::::::::53 4.3.3Reducingtheinputsbyremovingwaveletcoecients:50 4.3.2Reducingtheinputbyprojectiontoprincipalcomponents::::::::::::::::::::::::::::47 

4.6Experimentingwitharchitectureconstraints::::::::::57 4.5Intermediateresultsinputrepresentations:::::::::::55 

4.7Intermediateresultsarchitecturedesign::::::::::::65 4.6.4Experimentingwithconvolutionalregressionnets:::59 4.6.3Usingsymmetryconstraints:::::::::::::::57 4.6.2Experimentingwithlocallyconnectednets:::::::57 4.6.1Experimentingwithfullyconnectednets::::::::57 

5Theneuralnetworkasfastforwardmodel 5.2Applicationtoearthmodelswithinvasion:::::::::::73 5.1Applicationtoearthmodelswithoutinvasion:::::::::68 4.8Summaryandresults:::::::::::::::::::::::65 

5.3Applicationtorealisticearthmodel:::::::::::::::73 

6Conclusions 6.2Methods::::::::::::::::::::::::::::::83 6.1Neuralnetworkasfastforwardmodel?:::::::::::::81 6.2.1Inputrepresentation:::::::::::::::::::83 6.2.2Inputreduction::::::::::::::::::::::84 

ANeuralnetworksimulators 6.3Applicationoftheconvolutional-regressionnet::::::::85 6.2.3Architecturedesign::::::::::::::::::::85I

ListofTables 

54231 Parametersforearthmodelswithinvasion.::::::::::42 Performanceonearthmodelswithoutinvasion(1).::::::40 Parametersforearthmodelswithoutinvasion.::::::::37 Averagerelativeerrorforearthmodelswithoutinvasion(1).39 Thetoolresponse.::::::::::::::::::::::::13 

876 plingperiods.:::::::::::::::::::::::::::45 Comparinguniformsamplingmethodswithdierentsam- 

Comparingdiscretizedslidingwindowagainstattributesrepresentation.::::::::::::::::::::::::::::45 

9 Comparinguniformandnon-uniformsamplingmethods.::45 Comparinguniformsampledinputwithoutandwithprojectiontoprincipalcomponents.::::::::::::::::::48 

10Comparingnon-uniformsampledinputwithoutandwithprojectiontoprincipalcomponents.::::::::::::::::49 

11Comparinginputrepresentationswithdierentnumberof 12Comparinginputrepresentationswithdierentnumberof 13Comparingtrainingsetoftargetlogsandtrainingsetof waveletcoecients(1).::::::::::::::::::::::51 waveletcoecients(2).::::::::::::::::::::::51 

16Comparingdierentlocallyconnectednets.::::::::::58 15Comparingdierentfullyconnectednets.:::::::::::58 14Comparingtrainingsetoftargetlogsandtrainingsetofdif- cultpartsoftargetlogs.::::::::::::::::::::54 coarsersampledtargetlogs.:::::::::::::::::::54 

19Comparing\wavelet"netwithoutandwithsymmetryconstraints.::::::::::::::::::::::::::::::6tryconstraints.::::::::::::::::::::::::::60 

constraints.::::::::::::::::::::::::::::58 18Comparinglocallyconnectednetswithoutandwithsymme- 

17Comparingfullyconnectednetswithoutandwithsymmetry 

20Resultsforconvolutional-regressionnets(1).:::::::::63 21Resultsforconvolutional-regressionnets(2).:::::::::63 22Averagerelativeerrorandperformanceforearthmodelswith 23Averagerelativeerrorandperformanceforearthmodelswith 24Averagerelativeerrorandperformanceforearthmodelswith invasion(Shallow-log).::::::::::::::::::::::65 andwithoutinvasion(Shallow-log).:::::::::::::::66 andwithoutinvasion(Deep-log).::::::::::::::::67

ListofFigures 132 Laboratoryresistivitymeasuringapparatuswithunguarded planarelectrodes.:::::::::::::::::::::::::1 

partoftheformation.::::::::::::::::::::::3 InputtotheDeepLaterolog(left)andtheShallowLaterolog (right).TheDeepLaterologgetsinformationfromalarger electrodesinsteadofplanarelectrodesandtheDual-Laterolog.2 Modicationofthelaboratoryapparatususingcylindrical 

456 Environmentaleects:(A)idealsituation,(B)dippingformation,(C)cavedboreholes,(D)deviatedboreholesand 

Theinversionprocessandforwardmodellingofresistivitylogs.6 (E)horizontalboreholes.::::::::::::::::::::4 

10PartofamodelwithinputsignalRtandoutputsignalsDeeplog(LLd)andShallow-log(LLs).TheinputsignalsRxoand 

879 Localminimaintheerrorsurface.:::::::::::::::12 Adiagramofneuronj.::::::::::::::::::::::8 Aformationanditsmodel.:::::::::::::::::::13 Transferimpedances.:::::::::::::::::::::::6 

12Whentheslidingwindowissmallerthanthelargestbedinthe 11Exampleofslidingwindowinputrepresentation.Asliding dxoareomitted.:::::::::::::::::::::::::14 

situation,althoughthetargetdiersalot.:::::::::::17 model,theinputtothenetwillbethesameforthesketched theinputmodels.:::::::::::::::::::::::::15 windowofsizewisplacedaroundthepointofinterestalong 

16Theinputsignaliswrittenasaweightedcombinationofrectangularblockfunctions.Thecoecientsciareusedasinputs 

totheneuralnet.:::::::::::::::::::::::::25 

13Standardnormaldistribution.::::::::::::::::::19 15HaarwaveletfromboxfunctionWHaar(x)=(2x)(2x1).23 14Mostofthevariationofthedataliesinthedirectionof1.:21 

17Boundariesoutsidethecenterofthewindowarenotdescribed 20Adiscretizedinputsignalanditsmirrorimage.::::::::29 21Symmetryconstraintsonfullyconnectednet.:::::::::30 18Fullyconnected(left)andlocallyconnected(right)neuralnets.28 19Receptiveeldswhichoverlap13.::::::::::::::::29 accurately.:::::::::::::::::::::::::::::26 

23Symmetryconstraintsonwaveletnets.Thecoecientsare 22Symmetryconstraintsonlocallyconnectednet.Thereceptiveeldsareconstrainedsymmetrically(receptiveeldfis 

24Convolutionalnetwork.::::::::::::::::::::::32 constrainedperdetaillevel.::::::::::::::::::31 constrainedtoeldFf+1forFelds).::::::::::30

29Responseoftheneuralnettothedicultmodelshownabove.41 28ModelthatcausesdicultiesinapproximatingtheDeep-log.41 27Thenetworkandrelativeerrorforxedproportionsd=a.::35 26Convolutional-regressionnetwork.::::::::::::::::34 25Timedelayedneuralnetwork.::::::::::::::::::33 

31Non-uniformsamplinginslidingwindow.:::::::::::47 30Smalldierenceininputwithattributesinputrepresentation.44 

33LossofinformationpervariableforTest7.Originalinput 32LossofinformationpervariableforTest6.Originalinput consistsof364inputs,comingfromanon-uniformsampled slidingwindow.::::::::::::::::::::::::::48 slidingwindow.::::::::::::::::::::::::::49 consistsof3128inputs,comingfromauniformsampled 

35Intermediateresultsoftrainingthreedierentneuralnetson 34Notalltransitionsaredetectedbywaveletcoecients.:::52 

37Convolutional-regressionnet.Thersthiddenlayerconsists 36Convolutional-regressionnet.Thefeaturemapsintherst 6models.:::::::::::::::::::::::::::::56 

ofthreesetsoffeaturemaps.Eachsetconsistsofthreemaps andisconnectedtooneofthevariablesintheinputlayer.:61 layer.:::::::::::::::::::::::::::::::59 hiddenlayerareconnectedtoallthevariablesintheinput 

39ActivationofrsthiddenlayerperfeaturemapforTest21. 38TrueresistivityproleofmodelAatdepth615feet.:::::64 

41WorstcaseneuralnetapproximationofDeep-log.Average 40Intermediateresults(2)oftrainingdierentneuralnetson6 models.::::::::::::::::::::::::::::::66 TheseactivationsareformodelAatdepth615feet.This 

relativeerroris14.2%.:::::::::::::::::::::69 layerconsistsof6featuremapsof27nodeseach.:::::::64 

42BestcaseneuralnetapproximationofDeep-log.Average 43WorstcaseneuralnetapproximationofShallow-log.Average 44BestcaseneuralnetapproximationofShallow-log.Average relativeerroris2.6%.::::::::::::::::::::::70 

45ExamplesofearthmodelsusedforthenetInvasion.:::::75 46WorstcaseneuralnetapproximationofShallow-log.Average relativeerroris8.5%.::::::::::::::::::::::71 

47BestcaseneuralnetapproximationofShallow-log.Average relativeerroris1.2%.::::::::::::::::::::::72 

49NeuralnetapproximationofDeep-log.Averagerelativeerror 48A(realistic)earthmodel.::::::::::::::::::::78 relativeerroris7.5%.::::::::::::::::::::::76 relativeerroris4.5%.::::::::::::::::::::::77 is7.6%.:::::::::::::::::::::::::::::79

50NeuralnetapproximationofShallow-log.Averagerelative erroris8.3%.:::::::::::::::::::::::::::80

1ForwardModellingofResistivityLogs 

andhydrocarbonsdonotconductelectriccurrents,whereastheformation Therockporescanbelledwithwaterorhydrocarbons(oilandgas).Rock 1.1ResistivityLogging 

hydrocarbonsaturationwithintherocks.Thehydrocarbonsaturationisan waterdoes.Onereasonformeasuringtheresistivityistodeterminethe Aformationconsistsofseverallayers(beds)ofrock,whichcontainpores. 

indicationforthepresenceofoil.Asimpliedexpressionofthisquantitative aspectisexempliedbyArchie'sequation 

HereRwistheresistivityofthewaterintherockpores,Fistheformation Sw=FnRw 

factorgenerallyassumedtobederivablefromaknowledgeoftherockresistivity,Swisthewatersaturation(percentageofporespaceoccupiedby 

water),Rtisthemeasuredrockresistivityandnisanempiricallydeter- 

(Moran1985). 1.1.1Loggingtools minedsaturationexponent.HydrocarbonsaturationShisequalto1Sw 

Rt (1) 

Mostofthephysicsbehindtheresistivityloggingtechniquescanbefound in(Moran1985).Inthissectionwewilldiscussthebasicideasbehindthese techniques. 

1 

electrode B 

V 

electrodes. Asheetofmaterialwhoseresistivityistobedeterminedisplacedbetween Figure1:Laboratoryresistivitymeasuringapparatuswithunguardedplanar andincontactwithelectrodesAandB,bothofareaSasisshownin 

area S 

A 

distance dr in meter

electrode A 

I 

0 

I 

1 

I 

2 

I 

1 

2 1.FORWARDMODELLINGOFRESISTIVITYLOGS 

(m)yields Figure1.AvoltageVisappliedtotheelectrodes,resultinginacurrent applicationofOhm'slawformaterialthicknessdr(m)andresistivity I.IfIisdistributeduniformlyovertheareaSandiszerooutside,an 

Theresistivityisfoundbymeasuringthevoltagedropbetweentheelectrodes.Inpractice,thecurrentIwillnotbeuniformlydistributedoverthe 

discs,butwilltendtobemaximumattheedgesandminimuminthecentre. V=ISdr (2) 

discofareaS0carryingacurrentI0surroundedbytheremainderofthe Toimproveonthisscheme,onesplitsthediscsothatonehasasmallcentral discsheldatthesamepotentialVasS0.Thiswillresultinnearlyconstant currentdensityoverS0.ThemeasuredcurrentI0willbe\focused".The resistivityisnowgivenmoreaccuratelythanbeforeby =S0 drV I0 (3) 

A1 

I 

0 

I 

0 

A0 

I 

Figure2:ModicationofthelaboratoryapparatususingcylindricalelectrodesinsteadofplanarelectrodesandtheDual-Laterolog. 

D D0 

1 

A2 

I0 

I1 

I 

2 

I 

ThisideaisthebasicingredientoftheLaterolog.TherealLaterologuses 

1 

cylindricalelectrodesinsteadofplanarelectrodesasindicatedinFigure2 

I 

2 

B 

aDeepLaterolog.TheDeepLaterologhasitscurrentreturnelectrodeB (left).TheDualLaterologconsistsofaShallow(Pseudo)Laterologand 

SHALLOW DEEP 

remotelyatthesurface,whichresultsincurrentsasshowninFigure3.The

1.1ResistivityLogging 3 

DeepLaterologreadsfarintotheformation.TheShallowLaterologhas itscurrentreturnelectrodeplacedaboveandbelowtheelectrodeA,which makesthecurrentsbendbacktothetoolasshowninFigure3.ThisLaterologreadstheformationclosetothetool,whichmakesitmoresensitive 

totheinvadedzone(seeSection1.1.2). Bothmeasurementsaremadesimultaneouslybyusingdierentfrequencies; quencyfortheDeepLaterolog. arelativelyhighfrequencyfortheShallow-Laterologandaverylowfre- 

part of the 

formation that 

is seen by the 

tool 

part of the 

formation that 

is seen by the 

tool 

formation. Figure3:InputtotheDeepLaterolog(left)andtheShallowLaterolog 

tool 

tool 

1.1.2Environmentaleects (right).TheDeepLaterologgetsinformationfromalargerpartofthe 

Severalanomalies,likeinvasion,dippingbeds,washed-outboreholesand 

in(Gianzero1977),(Asquith1982),(Chemali,Gianzero&Strickland1983) occurareshowninFigure4andmoredetaileddescriptionscanbefound oftheformation.Theseanomalieshaveaneectonthetoolresponse, whicharediculttomodel.Situationsinwhichtheseenviromentaleects veryresistiveformations,areencounteredwhenmeasuringtheresistivity 

pressure,whichpreventsblow-outs. and(Chemali,Gianzero&Su1988). Wellsaredrilledwithrotarybits.Specialdrillingmudisusedascirculating uid.Themudremovescuttingsfromthewellbore,lubricatesandcoolsthe drillbitandhelpsmaintaininganexcessofboreholepressureoverformation


Thispressuredierenceforcessomeofthedrillinguidtoinvadeporous andpermeableformation.Inthisprocessofinvasionsolidparticles(clay mineralsfromthedrillingmud)aretrappedonthesideoftheboreholeand formaso-calledmudcake.Thepartoftheformationwhichisinvadedby mudltrateiscalledtheinvadedzone. 

A 

B 

tool 

current flow 

C D E 

Thesizeoftheboreholeandthemudcakeresistivityinuencethemeasured (C)cavedboreholes,(D)deviatedboreholesand(E)horizontalboreholes. Figure4:Environmentaleects:(A)idealsituation,(B)dippingformation, 

eectiveinmoderatelysalinetoverysalinemud,andlesseectiveinfresh resistivity.Thesmoothingofanomaliesonthelogbytheboreholeisquite IndippingformationstheDualLaterolog-curvesvaryslowlyacrossbed portionsandtheerrorduetoshoulderbedeect(theinuenceofthebeds mud.Thesizeoftheboreholeandtheresistivityofthedrillinguidare 

adjacenttothecurrentbed)isdierentfromthenon-dippingcase. boundaries.Theapparentbedthicknessisincreasedinpredictablepro- 

takenxedinourstudy. 

tweenbedsalsoaectthetoolreadings.Formationslikelimestoneand occuronlyatabruptchangesinholediameter. Veryresistiveformationsandformationswithhighresistivitycontrastsbe- 

oftheincreasedholediameter.AnomalousreadingsoftheDeepLaterolog Incavedboreholes(wash-out)theShallowLaterologissensitivetotheeect 

dolomitecanhaveresistivitiesover2000m,whilemostformationshavea resistivitybetween1and70m. OnlytheShallowLaterologexhibitssomesensitivitytoeccenteringinlarge boreholes. Thetoolresponseisessentiallyimmunetoasmalleccenteringofthetool.

1.2InversionbyForwardModelling 5 

Theoveralleectofanellipticalboreholeisthatitproducescharacteristicresponseswhichliebetweenthoseobtainedintwocircularholeswith 

diametersequaltothemajorandminoraxesoftheellipticalhole. importantinconductivemudthanitisinnon-conductivemud.TheShallowLaterologhasmuchlessshoulderbedeectthantheDeepLaterolog, 

Deviatedboreholesandhorizontalboreholesalsogivedierenttoolreadings.Comparethecurrentowintheidealcase(Figure4)andthesetypes 

ofboreholes. 1.2InversionbyForwardModelling 1.2.1Inversion Thefoundresistivitylogs,Shallow-andDeep-log,havetobeinvertedto thetrueresistivityoftheformation.Thisisdonebyaniterativeforward rstcorrectedwithchartbookcorrections(tornadochartsforexample;for boreholesize,invasion,etc.)Thenaninitialguessfortheformationmodelis modellingprocess,whichissketchedinFigure5.Theactualeldlogsare 

TheshoulderbedcorrectionrequiredfortheDeepLaterologismuchless especiallyforbedthicknessesabovetenfeet. 

boreholediameterandbeddingdip,thicknessandresistivity(Anderson& geometryand\parametervalues"|numbersassignedtovariablessuchas Barber1990).Thenthetoolphysicsisusedtocomputeanexpectedlog, made.Thistrialmodelincludesadescriptionoftheboreholeandformation whichiscomparedwiththeactualeldlog.Ifthematchisnotgoodenough, 

ForwardModelling)andamatchingprocedure.Thelatterprocedureimplies guessedformationmodeltotheexpectedDeep-andShallow-log(so-called Theprocessconsistsoftwosteps:thefunctionapproximationfromthe isiterateduntilthetwologsmatchsatisfactorily. theinitialtrialmodelisalteredandthecalculationrepeated.Thisprocess 

aminimalizationofamatchingerrorbetweenthetwologs.Thiscanbe maximumentropymethods(Anderson&Barber1990). donebyhandorusingsophisticatedalgorithmsliketheleast-squaresand


Formation model 

Expected log 

Forward 

Modelling 

Adapt 

Matching 

initial guess 

of formation 

procedure 

1.2.2ForwardModelling Figure5:Theinversionprocessandforwardmodellingofresistivitylogs. 

oftheprocess.TocomputetheLaterologresponseforagivenelectrodearray Theforwardmodellingpartoftheinversionisthemosttime-consumingpart 

Actual measured field V1 I1 

Z 12 

Z 

11 

Z 

V 

01 

Z 

0 I 

00 

0 

Z 

02 

anddistributionofresistivities,itissucienttodeterminetheassociated 

Z 

transferimpedances.ThetransferimpedanceZijisequaltoVj 

V2 

I 

22 

2 

thevoltagemeasuredatpartjoftheelectrodecongurationandIiisthe Figure6:Transferimpedances. 

currentemittedbyparti.Aspecicelectrodecongurationanditstransfer Ii,whereVjis 

ELECTRODE 

TRANSFER 

CONFIGURATION IMPEDANCES

1.3NeuralNetworks 7 

insimpliedmodels(concentriccylindricalboundariesorplaneboundaries) impedancesareshowninFigure6.Forthecomputationoftheresponse onecanuseanalyticalapproaches,butwithextendedelectrodesforexample, 

certainboundaryconditions. ZijinvolvesthesolutionofLaplace'sequationintwospacevariablesunder beenpublished(Moran1985).Thedeterminationofthetransferimpedances dippinglayers,theproblemisofsuchcomplexitythatnoresultshaveasyet theproblemcanonlybehandledbynumericalmethods.Inthecaseof 

Thereareanumberofmethodsthatcanbeusedtosolvetheseboundary valueproblems,wementiontheFiniteElementMethod,theBoundaryelementornite-dierencetechniqueandaHybridMethod(Gianzero,Lin& 

Su1985). 

(horizontal)dependenceistreatednumericallyandtheaxial(vertical)dependenceanalytically.Itcombinesthemodeconceptinwave-guidetheory 

withtheFiniteElementMethod.Thismethod,employedbyGianzero,has ElementMethod(Gianzeroetal.1985). beenabletosimulatea100footlogwith25bedsinlessthantwelveminutesonanIBM3081,whichisapproximately8timesfasterthantheFinite 

directconsequenceofminimizingthetotalenergyofthesystem. ciple(Chemalietal.1983).ItcanbeshownthatLaplace'sequationisa TheHybridMethodisaseparationofvariablesapproachwheretheradial TheFiniteElementMethodisanumericalmethodbasedonanenergyprin- 

ANeuralNetworkconsistsofanumberoflayersthatconsistofnodes(neurons).Aneuronreceivesinputfromtheneuronsitisconnectedtoandcan 

initsturnserveasinputtootherneurons.Aconnectionbetweennodeiof Agoodintroductioncanbefoundin(Haykin1994). Inthissectionwewillonlydescribethebasicingredientsofneuralnetworks. 1.3NeuralNetworks 

wji.Thetotalinputujfornodejis acertainlayerandanodejofanotherlayerischaracterizedbyaweight 

wherexiistheactivationofaninputnodeiandthesummationrunsover allnnodesthatnodejreceivesinputfrom. uj=nXi=1wjixi (4)


. 

. 

. 

. 

. 

. 

. 

. 

. 

. 

activation function 

summing 

function 

input 

signals 

synaptic 

weights 

treshold 

output 

θ 

φ(.) 

x 

x 

x 

w 

w 

w 

1 

2 

n 

j 

j 

1 

2 

n 

y 

u 

j 

j 

j 

j 

Figure7:Adiagramofneuronj. 

Nowthisnodeusesanactivationfunctiontodetermineitsownactivation 

andoutput yj=(ujj) (5) 

Hereisacertainactivationfunction,whichisusuallyasigmoidfunction 

forthehiddennodesandalinearfunctionfortheoutputnodeinregressionproblems,andjisacertainthresholdfornodej.Figure7showsa 

visualizationofthisprocess. 

Thisprocedureofdeterminingtheinputandtheoutputofanodeisdonefor 

allnodes,excepttheinputnodes.Theinputnodesreceivetheiractivation 

directlyfromoutside(fromaleforexample).Inthiscasetheinputissome 

kindofrepresentationoftheformationmodel. 

Thetimethatisneededtodeterminetheoutputoftheneuralnetfora 

certaininputcanbeexpressedinthenumberofconnections(weights)ofthe 

net.Theoperationsdonebythenetareweightmultiplications,summations 

andcalculatingtheactivation. 

Determiningtheoutputofthenetwillnottakemuchtime.However,the 

processofmakingthenetlearntheproblem,training,takesconsiderably 

moretime.Whenusingneuralnetworks,thefollowingaspectsareimportant 

forthetrainingtimeandthegeneralizationability.Anetissaidtogeneralize 

wellifitproduces(nearly)correctoutputforinputthatwasnotusedduring 

itstraining. 

Thearchitectureofthenet.Largenets(highnumberofweights)learn 

slowlyandusuallydonotgeneralizewell.Thisisduetothefactthat 

theneuralnet\remembers"itstrainingexamplesifithastoomuch 

freedom(toomanyweights).


Thetrainingmethod.Severaltrainingmethodsforsupervisedlearningexist,forexampleback-propagation,conjugategradient,steepesingmeansthatthenetproducessomeoutput(theactualresponse) 

certainparametersisveryimportantanddicult.Supervisedlearn- 

andcorrectsitsbehaviouraccordingtothecorrectoutput(desiredre- 

descent,momentumdescent.Forsomeofthesemethodsthetuningof sponse).Formultilayerfeedforwardnetworks(networksthathavecon- 

nectionsdirectedfromtheinputtotheoutput),themostwidelyused algorithmistheback-propagationalgorithm.Therearetwophases intheBP-learning.Intherstphase,theforwardphase,theinputsignalspropagatethroughthenetworklayerbylayer,eventually 

responsesoproducediscomparedtothedesiredresponse.Theerror signalsgeneratedbythiscomparisonarethenpropagatedbackwards producingsomeresponseattheoutputofthenetwork.Theactual 

Thetrainingset.Thetrainingsetshouldberepresentativeforthe throughthenetinthesecond,thebackwardphase.Moreinformation 

problem,otherwisethenetisnotabletolearntheproblemortogeneralizewell.Withconjugategradienteachtrainingexampleisevaluated 

ontrainingmethodscanbefoundin(Haykin1994). 

duringtraining(batchtraining),soalargetrainingsetcausesalong 

1.3.1Networkarchitecture inSection1.3.3. trainingtime.Moreonthecreationofthetrainingsetcanbefound 

perlayerandtheconnectionsbetweenthelayers;itdescribeswhatthenet Thenetworkarchitectureisadescriptionofthelayers,thenumberofnodes nodesofonelayerreceiveinputfromallnodesinthepreviouslayer.This lookslike.Forexampleafullyconnectedneuralnetisanetinwhichthe 

cationsforthelearningandgeneralizationabilityofthenet. Anothermethodisalocallyconnectednetwork,whichisdescribedlater Thearchitecturedeterminesthenumberofweightsandhascertainimpli- 

isthemostcommonarchitecture,butitleadstoalargenumberofweights. whenweturntoreceptiveeldsandconvolutionalnetworks. 

1.3.2TheLearningMethod Anetusesalearningmethodtominimalizetheerrorinthenet.Thetraining setconsistsofPpatterns(xp;dp).Here,xpistheinputpatternpanddp isthedesiredoutputforthisinputpattern.Foracertaininputxpthe networkcalculatesanoutputap.Theerrorisgivenbythequadraticsum


ofthedierenceofthisoutputandthedesiredoutputdp: 

isthattheweightsareadaptedduringtrainingsothatEisminimalized. Here,Pisthenumberofexamples.Theideabehindthelearningmethods E=PXp=1(dpap)2 (6) 

adaptedinthedirectionwiththesteepestdescent Thisisdonebyadownhilltechnique,gradientdescent.Theweightsare 0B@Weight 

wji1CA=0B@learning-rate parameter 1CA0B@local Theweightcorrectiondependsonalearningparameter,thelocalgradient gradient1CA0B@inputsignal ofneuronj toneuroni 1 CA(7) andtheinputsignalofneuronitoneuronj.IfEnolongerdecreasesthe 

betweenthedirectionandthegradientvectors.TheCGmethodisthemost bythegradientdescentmethod,byincorporatinganintricaterelationship ofthenormalgradientdescentmethod.Itavoidsthezigzagpathfollowed trainingstops. 

convenientalgorithm,becauseitneedsnotuningparametersanditisfaster Wehaveusedtheconjugategradient(CG)method,whichisanadaptation 

thannormalback-propagationifthesizeofthetrainingsetisnottoolarge. WehaveusedtheXerion(version3.1and4.0)simulatortotrainthenetworks (formoredetailsonneuralnetworksimulatorsseeAppendixA). 

layer.Howthisisdone,isdiscussedinthenextchapter.Nowwewillfocus Theinputhastobepresentedtothenet,sothephysicalmodelmustbe transformedintoasetofnumbersthatfunctionasactivationsoftheinput 1.3.3Thetrainingset 

onthescaling. 

theweightswillingeneralbefarapartaswell.Thiswillslowdownthe alsohavealargevariation.Soiftheinputsdonotlieinasmallrange, inputshaveaveryhighratio(forexamplei1=1andi2=1000)theweights range[1;1].Aswehaveseenthenetcalculatesaweightedinput.Iftwo Theinputtothenetmustbescaledsothevaluesliemoreorlessinthe 

setofweights.Anotherpointofviewisthatlargeinputvalueswillhave training,becausealargerweightspacehastobesearchedforanoptimum moreinuenceontheactivationofthenodestheyareconnectedto.Inthis waywehavealreadybuildsomepriorknowledgeintothenet.Toavoidthis, wescalealltheinputstoasmallrange.Whencertaininputsareimportant, inSection2.1.3. thenetcanlearnthatitself.Moreonthissubjectofscalingcanbefound


Atrainingsetshouldbesucientlylarge.Althoughthereisnogeneral prescriptionofhowlargeatrainingsetshouldbe,therearesome\rules", likethefollowingfromBaumandHassler(Haykin1994).Anetworkwill followingtwoconditionsaremet: almostcertainlyprovidegeneralization(seenextsection),providedthatthe 

2.Thenumberofexamples,Pusedintrainingis 1.Thefractionoferrorsmadeonthetrainingsetislessthat2. 

Here,WisthetotalnumberofweightsandMisthetotalnumber P32Wln32M 

ofhiddennodes.Thisformulaprovidesadistribution-free,worstcaseestimationforthesizeofthetrainingsetforasingle-layerneural 

(8) 

Atrainingsetshouldalsoberepresentative.Thismeansthattheexamples inthetrainingsetarerandomlygeneratedanddistributedoverthewhole inputspace. networkthatissucientforagoodgeneralization. 

1.3.4Generalization 

bythreefactors: isdeterminedbyitsabilitytopredictoutputstoinputsithasnotseen duringtraining,whichiscalledgeneralization.Generalizationisinuenced Althoughaneuralnetcanlearnanyinput-outputmapping,itsapplicability 

thesizeandrepresentativenessofthetrainingset, 

Withtoofewexamplesthenetjustmemorizesthetrainingsetandexhibits thephysicalcomplexityoftheproblemathand. thearchitectureofthenetwork, 

poorgeneralization.Ifthenumberofexamplesismorethanthenumber ofweights,thenetwillgeneralizebetter.Widrow'sruleofthumb(Haykin setsizeofapproximately10timesthenumberofweightswhentheerroron thetrainingsetis10%. Inourproject,wewillcomparedierentarchitecturesononespecictrainingset.Thearchitecturethatgivesthebestresults,intermsoftraining 

error,generalization(testing)errorandcomplexity(numberofweights),will 

1994)comesfromequation8andstatesthatinpracticeweneedatraining 

betrainedonalargertrainingset.


global minimum 

Error surface 

local minimum 

local minimum 

InFigure8(left)theerrorsurface,belongingtospecicweightvectors, 1.3.5Localminima Figure8:Localminimaintheerrorsurface. 

global minimum 

local minimum 

initialization 

technique,whichmaycausethealgorithmtogettrappedinalocalminimum isshown.Thelearningalgorithmthatweuseisbasicallyahill-descending 

minimum 

errorsmallerthanthepreviousone.Butsomewhereelseintheweightspace thereexistsanothersetofsynapticweightsforwhichthecostfunctionis intheerrorsurface,althoughweareinterestedintheglobalminimum.The algorithmgetstrapped,becauseitcannotndadirectionwhichmakesthe 

tializations.ThisprocessisshowninFigure8(right);withdierentweight smallerthanthelocalminimuminwhichthenetworkisstuck.Onemethod toavoidlocalminimaisretrainingtheneuralnetwithdierentweightini- 

initializationthenetwillconvergetodierentminima. 

andtheShallow-log.Theformationisdescribedbyanumberofbeds.For eachbedanumberofradialzonesaregivenandforeachradialzonethe 1.4ForwardModellingusingaNeuralNetwork ThegoalofthisprojectistoobtainagoodapproximationoftheDeep- 

resistivity(m)anditssize(inch)isgiven.Therstradialzoneisthebore 

Thetoolresponseconsistsoftwocontinuouslogs(theShallow-logandthe thetrueresistivity.Allthiscanbedescribedwithamodellikeshownin holewithitsradiusandtheresistivityofthedrillinguid,thenextradial 

Deep-log)likeshowninFigure10andTable1.ThemodelshowninFigure Figure9(thecorrespondingformationisshownontherightside). zonedescribestheinvasion(ifthereisany)andthelastradialzonedescribes 

innity(thisisthesameforallmodels,indicatinganinniteshoulderbed). 9contains80beds.Therstbedhas3radialzonesanditstartsatminus 

aresistivity(Rt)of27m.Thelastradialzonehasaradiusofinnity 45inchandaresistivity(Rxo)of1.90m.Andnallythethirdzonehas andthemudresistivityis0.05m.Thesecondzonehasaradius(dxo)of Therstradialzoneofthisbed(theborehole)hasaradiusof4.25inch

1.4ForwardModellingusingaNeuralNetwork 13 

number of beds 

borehole radius 

4.25 inch 

(resistivity drilling fluid 

0.05 Ωm) 

1e+09 

Rt = 27 Ωm 

80 

3 

-1e+09 

one bed 

dxo = 45 inch 

0.05 

4.25 

1.90 

27.00 

45.00 

1e+09 

Rxo = 1.90 Ω m 

start depth of 

bed 

number of 

3 0.00 

Rt = 12 Ωm 

radial 

borehole 

zones 

0.05 4.25 radius 

0.90 19.00 

dxo = 19 inch 

invasion 

resistivity 

radius 

12.00 1e+09 

of drilling 

fluid 

(otherwisetherewouldbeanotherradialzone). 

radius of 

Theneuralnetcanbeusedasafastforwardmodelwhentheaveragerelative Figure9:Aformationanditsmodel. 

virgin errorbetweentheforwardmodeloutputdandtheneuralnetworkoutputa 

invasion 

true 

Rxo = 0.90 Ω m 

liesbelow5%. 

resistivity 

resistivity 

depth 0.0.110610E+02.330318E+01 0.2.108760E+02.325301E+01 0.4.106962E+02.320733E+01 Table1:Thetoolresponse. 

0.6.105092E+02.316235E+01 LLd LLs 

radiusandresistivityofthedrillinguidarexedto4.25inchand0.05m Theearthmodelsinthisprojectarecreatedbyassigningrandomnumbers withinacertainrangefortheparametersRt,Rxoanddxo.Theborehole . . . 

forwardmodelthatisusedatKSEPL. respectively.Thetoolresponsestothesemodelsarecalculatedwiththe


Ohm meter 

True resistivity Rt 

Deep-log 

Shallow-log 

(LLd)andShallow-log(LLs).TheinputsignalsRxoanddxoareomitted. Figure10:PartofamodelwithinputsignalRtandoutputsignalsDeep-log 

100 

10 

5400 5420 5440 5460 5480 5500 

depth (feet)

15 

point of interest 

(window is centered around this point) 

Tool response 


Invasion resistivity Rxo 

AsdescribedinSection1.3,thetrainingtimemainlydependsonthesizeof increasethegeneralizationcapability,weliketokeepthenetassmallas thenetandthesizeofthetrainingset.Todecreasethetrainingtimeand 2Inputrepresentationandarchitecturedesign 

possible.Thiscanbeachievedbychoosingacompactinputrepresentation andbydecreasingthenumberoffreevariables(weights)inthenet.The inputrepresentationhastobecompactandappropriatetotheproblem. Tofacilitatethelearningprocess,weemploytwomethods.Firstlywepreprocesstheinputandsecondlyweforcecertainconstraintsonthenetwork 

methodcontributestofacilitatinglearningonlyiftheresultingstructure networkwhilemakingonlyasmallcomputationaloverhead.Thesecond reectsthedesigner'saprioriknowledgeoftheproblem.Otherwisethe aterepresentationoftheinputwhichsimpliestheproblemfortheneural architecture.Thepurposeoftherstmethodistocreateanintermedi- 

2.1Inputrepresentation networkisaprioribiasedtowardswrongsolutions. 

Wearelookingforacompactandappropriateinputrepresentation.Given anearthmodel,describedbyn(2+r2)+1values(fornbedsofrradial 

Figure11:Exampleofslidingwindowinputrepresentation.Aslidingwindowofsizewisplacedaroundthepointofinterestalongtheinputmodels. 

Invasion diameter dxo 

w

16 2.INPUTREPRESENTATIONANDARCHITECTUREDESIGN 

(thetargetisalsocalledthedesiredoutput).Foroneformationwehavea toolresponseofmfeet,sampledeverytfeet.Thisproducesmt+1targets Thetoolresponseataspecicdepthisusedasthetargetoftheneuralnet perlog.Butwhatdoweuseasinputtoproducethistarget?Weassume zones),thenetshouldbeabletoproducethetoolresponseatanydepth. 

slidingwindow.TheslidingwindowapproachisshowninFigure11. forthetoolresponseatthatdepth.Thispartoftheformationiscalleda thatapartoftheformation,centeredaroundacertaindepth,isresponsible 

Therstmethodisquitestraightforward.Itsamplesthepartoftheformationthatliesinthewindow,withoutusingknowledgeabouttheinput 

orrelationsbetweeninputs. Thesecondmethodlooksmoreliketheoriginalmodeldescriptionanduses thefactthattheformationisdescribedbybeds.Inthemodeleachbedis centeredaroundthepointofinterestbyanumberoffeatures. describedbyanumberofvalues,whichcouldbeseenasattributesofthat 

modelintheslidingwindowandbydescribingthebedsthatlieinthe Theslidingwindowisdescribedintwoways:bydiscretizingtheformation 

2.1.1Discretizedslidingwindow bed.Intheattributesapproachwedescribethebedsthatlieinawindow, 

Whenthereisinvasion,seeSection1.1.2,theformationmodelcontainsthree 

inputswhenthereisnoinvasionand3timesthisnumberwhenthereis sampledwithinthiswindowwithasamplingperiods,resultinginws+1 isplacedalongtheinputlogsaroundthepointofinterest.Themodelsare containsonlythevariableRt.Weuseaslidingwindowofxedsizew,which variablesRt,Rxoanddxo.Whenthereisnoinvasiontheformationmodel 

oftheslidingwindow: invasion. Thefollowingaspectsshouldbetakenintoaccountindeterminingthesize Thesizeofthetool.Thecurrentsowingfromthetoolpenetrate theformation.ThecurrentsoftheShallowLaterologpenetratethe formationandreturntothetopandbottomofthetool.Thepartof 

Thetypeoftargetlog(DeeporShallow).AsshowninFigure3,the largeasthetoolitself.Thetoolisapproximately30ft,whichgives theformationthatthetoolreceivesinformationfrom,isatleastas 

DeepLaterologreceivesinformationfromalargerpartoftheformation.ThecurrentsoftheShallowLaterologreturntothetoolitself 

anindicationthatthewindowsizeshouldalsobeatleast30ft. 

windowsizefortheDeep-logshouldbelargerthanfortheShallow-log anddonotpenetratetheformationmuch.Thisindicatesthatthe

Ω meter 

60 

45 

30 

input model Rt 

15 

2.1Inputrepresentation 17 

Thesizeofthebeds.Whathappensifthewindowissmallerthan thelargestbedintheformationisshowninFigure12.Thesliding windowislocatedmorethanonceinthesamebed,producingthe sameinput,butpossiblynotthesametarget.Nowtheneuralnet hastolearnf(x)=y1andf(x)=y2,makingthetheproblemnondeterministic.Conictingexamplesmakeitverydicultforthenet 

dierenttargetsforoneinput. tolearntheproblem,becauseitadaptsitsweightstoreproducetwo 

0 

second (first 

model,theinputtothenetwillbethesameforthesketchedsituation, Figure12:Whentheslidingwindowissmallerthanthelargestbedinthe 

shifted by some samples) 

to net looks like 

30 30 30 30 .... 30 30 30 30 

WeexpectthenetthatistrainedontheShallow-logwillperformbetter, becauseitneedsasmallerwindowthantheDeep-log.Thesamplingperiod althoughthetargetdiersalot. 

first window, to looks like 

30 30 30 30 ... 30 30 30 30 

ofthetargetlogdeterminesthenumberofexamplesweobtainperlog.In reasons,wehavetakenveloggingpointsperfeet.Thesamplingperiodin theslidingwindowisimportantfortheresolutionofthebedboundaries. realapplicationsonetakestwologgingpointsperfeet,butforeciency 

windowasforthetargetlog.Inthiscasethatwouldmeanusingasampling withsfeetresolution.Whenwewanttohaveatleastthesameaccuracy periodof0.2feet. Whenweuseasamplingperiodofsfeet,wecandescribeabedboundary 

Thisinputrepresentationisnotverycompact.Whenweuseaslidingwindowof25.4feetandasamplingperiodof0.2feet,wehave384inputs(128 

asthetargetlog,weshouldusethesamesamplingperiodforthesliding 

pervariableand3variableswhenthereisinvasion).


bexedforeachsample.Weuseaxedsizewindowanddescribethebeds 2.1.2Attributes Theinputmodeldescribesanumberofbedboundaries,eachdescribedby anumberofradialzones.Theneuralnetrequiresthenumberofinputsto thatoccurwithinthiswindow.Ifthewindowcontainslessthanthexed shoulderbeds.Theorderinwhichthebedsarepresentedtotheneuralnet numberofbeds,weadd\default"beds.Thesebedsfunctionasinnite 

thisbedarepresentedandthebedsadjacenttothosebedsandsoon. shouldbexed(inourcaseitispresentedrst).Thenthebedsadjacentto Eachbedisdescribedbyanumberofattributes(alsocalledfeatures).The thetargetsignalforexample(probablythebedinthecenterofthewindow) isimportant.Thelocationintheinputofthebedthathasmostinuenceon 

contrastbetweentwobedsisdenedasv1=v2andthedierenceasv1v2 forvaluesv1andv2.Theattributesweuseare: 1.thetrueresistivityofthebed(Rt); 

4.theinversedistancetotheloggingpoint(pointsclosetothebedboundaryareconsideredtobemoreimportantthanpointsthatliefurther 

away); 3.theinvasionradiusofthebed(dxo); 2.theinvasionresistivityofthebed(Rxo); 

5.thecontrastbetweenthetrueresistivityofthisbedandthebedthat 6.thecontrastbetweentheinvasionresistivityofthisbedandthebed 

7.thecontrastbetweentheinvasionradiusofthisbedandthebedthat thatliesbelowthisbed; 

8.thedierencebetweenthetrueresistivityofthisbedandthebedthat 

9.thedierencebetweentheinvasionresistivityofthisbedandthebed 

10.thedierencebetweentheinvasionradiusofthisbedandthebedthat thatliesbelowthisbed; 

centbeds.Theinversedistancetotheloggingpointis0forthedefaultbeds Thedefaultbedshavenocontrast(1)andnodierence(0)withtheiradja- 

(thebedscontinuetoinnity).Ifwedescribenbedsinthechosenwindow, liesbelowthisbed. 

thisresultsin10ninputs.

2.1Inputrepresentation 19 

Thisapproachismorecompactthanthediscretizedslidingwindowapproach,butitisdiculttochooseappropriatefeaturesthatwillfacilitate 

problem,whenthesefeaturesarenotdescribingtheproblemwell. befacilitated.Itcouldevenmakeitdicultfortheneuralnettolearnthe thelearningprocess.Weassumethecontrastsanddierencesareimportant anddescribetheproblemwell.Ifthisisnotthecase,thelearningwillnot 

combinationoflogarithmicandnormalizationscalingandfortheothervariablesanormalizationscaling.Thetrueresistivity(andthetoolresponse) 

Asdiscussedintherstchapter,theinputshouldbeapproximatelyscaledto thedomain[1;+1].Forthetrueresistivityandthetoolresponseweusea 2.1.3Inputandoutputscaling 

canrangebetween1and2000m.Toreducethisrange,weusealogarithmicscaling.Therangeoftheothervariables,theinvasionresistivity 

logarithmicscalingonthesevariables.Thescalingstakethefollowingform andtheinvasionradius,aremuchsmallerandthereforewedonotusethe 

where,xistheoriginalinputoroutputvalue,themeanandthestandard x0norm=x x0log=ln(x) (10) (9) 

deviationofthevariablex.Thefactorisappliedtomakethescaledrange evensmaller(intheexperiments=2). AnormalizedGaussiandistributionisshowninFigure13.Fromthisgure wendthatfor=1,68.27%ofthevalues(ofthisspecicdistribution) liebetween-1and+1.Whenweuse=2wendthat95.45%ofthe valuesliebetween-1and+1. 

-3 -2 -1 1 2 3 

applied. Intheattributedescriptionapproachonlyanormalizedscalingmethodis Figure13:Standardnormaldistribution. 

68.27 % 

95.45 % 

99.73 %


quencefortheminimalizationofthenetworkerror.Formoredetailsonthe Thiscombinedscalingmethodofthetoolresponsehasanattractiveconse- 

2.2Preprocessing minimalizationofthenetworkerrorseeSection2.4. 

Inthissectionwewilldescribethepreprocessingmethodswehaveusedin theproject.Thepurposeofpreprocessingistondanintermediateinput representationthatfacilitatesthelearningprocess.Anotheradvantageof 

Rt,Rxoanddxoandsecondlyasthreefunctionsofthedepthx,Rt(x), windowapproach. Wecanviewtheinputintwoways.FirstlyasthreeN-dimensionalvectors wasonlyappliedtotheinputthatwascreatedbythediscretizedsliding preprocessingisthatwecanreducethenumberofinputs.Thepreprocessing 

Rxo(x)anddxo(x).Thelatterisactuallyalsoavector,becausethefunctionsareequallysampledwithinaninterval[1;N].Intherstcasewe 

projecttheinputtoanM-dimensionalsubspacespannedbytheprinciple componentsoftheinput.Intheothercaseweuseasetoforthogonalbasis functions,theHaarwavelets,toprojecttheinput. Thedisadvantageofinputreductionisthelossofinformation.Hopefully, theinformationthatislosthasanegligibleinuenceonthelearningand generalizationofthenet. 2.2.1PrincipalComponentAnalysis Thefollowingdescriptionoftheprincipalcomponentanalysisistakenfrom (Hertz,Krogh&Palmer1991). 

ProjectingthedatafromtheiroriginalN-dimensionalspaceontotheMdimensionalsubspacespannedbythesevectorsperformsadimensionality 

reductionthatretainsmostoftheintrinsicinformationinthedata. inthesubspaceperpendiculartotherst.Withinthatsubspaceitistaken maximumvariance.Thesecondprincipalcomponentisconstrainedtolie alongthedirectionwiththemaximumvariance.Thenthethirdprincipal Therstprincipalcomponentistakentobealongthedirectionwiththe componentistakeninthemaximumvariancedirectioninthesubspace alonganeigenvectordirectionbelongingtothekthlargesteigenvalueofthe Ingeneralitcanbeshownthatthekthprincipalcomponentdirectionis 

analysis(PCA),alsoknownastheKarhunen-Loevetransformincommunicationtheory.TheaimistondasetofMorthogonalbasisvectors 

Acommonmethodfromstatisticsforanalyzingdataisprincipalcomponent (eigenvectors)thataccountforasmuchaspossibleofthedata'svariance. 

perpendiculartothersttwo,andsoon.AnexampleisshowninFigure14.

2.2Preprocessing 21 

ε 

1 

e1 

ε 

2 

e2 

covariancematrix.ThismatrixiscalculatedforPpatternsby Figure14:Mostofthevariationofthedataliesinthedirectionof1. 

Covariance(i;j)=Pp(xpixi)(xpjxj) 

e3 

Here,xpiisinputiofpatternp,xpjisinputjofpatternpandxiand xjarethemeansofinputiandinputjrespectively.Thenthematrixis diagonalizedandtheeigenvaluesarecalculated. Theoriginalinputvectorx(Rt;Rxoordxo)iswrittenas wheree1,...,eNaretheoriginalbasisvectorsasshowninFigure14.We canalsowritetheinputvectorinanotherorthogonalsetofbasisvectors,the eigenvectors1;:::;Nx=x1e1+x2e2+:::+xNeN (12) 

ToreducethesizeofthevectorsfromNtoM,weprojectthisvectorto x=(1x)1+(2x)2+:::+(Mx)M+:::+(Nx)N x0=(1x)1+(2x)2+:::+(Mx)M (13) 

theneuralnet. InsteadofusingNvaluesx1;x2;:::;xNfromtheoriginalinputvectorweuse Mvalues1x;2x;:::;Mxoftheprojectedinputvectorasinputsfor (14) 

P (11)


Tocalculatethepercentageofinformationthatislostbythisprojection,we rsthavetowritex0intheeibasisvectors: Thevaluesaiarecalculatedby x0=a1e1+a2e2+:::+aNeN (15) 

patternsintheinputle)by Theinformationthatislostbythisprojectioncanbecalculated(forP ai=NXn=1xnMXm=1mnmi (16) 

Lossofinformation=1PPXp=1jjx0pxpjj 

2.2.2Wavelettransform jjxpjj (17) 

onalbasisfunctionsfi(x) Wecanwriteanyfunctionf(x)asaweightedcombinationofotherorthog- 

f(x)isacontinuousfunction,weusethecoecientsci.Wechoosefunctions Insteadofusingthevaluesoff(x)(asinputs),whichmaybeinnitewhen f(x)=Xicifi(x) (18) 

fi(x)withproperties,thatmakeiteasiertomanipulatewiththosefunctions AcommonlyknownmethodistheFouriertransform,wheretheorthogonal thanwiththeoriginalfunctionf(x). basisfunctionsaresin(ax)andcos(ax).Thesebasisfunctionsallowyouto describethefunctionondierentfrequencylevels. 

anotherinterestingsetoforthogonalbasisfunctions,calledwavelets.Weare OurinputsignalsRt(x),Rxo(x)anddxo(x)haveaveryspecialshape:they 

especiallyinterestedinthesimplestwavelets,theso-calledHaarwavelets. inthiscase,becausewewouldneedaninnitenumberofcoecientsand basisfunctionstocorrectlymodelthediscretetransitions.Thereishowever areallrectangular\functions".TheFouriertransformisnotappropriate 

partofthewindow(varyinginsizeandlocation).Thepropertytheydescribeisthedierencebetweentheaveragevalueovertherstandseconestingaboutwaveletsisthattheinputcanbedescribedondierentdetail 

levels.Allcoecients,excepttherst,describeaspecicpropertyovera ThesewaveletsareblockfunctionsasshowninFigure16.Whatissointer- 

halfoftheirpartofthewindow.Therstcoecientdescribestheaverage overthewholewindow.


Wewillrstgiveashortintroductiononwaveletsandhowthecoecients AwaveletisdenedbyW(x)=Xk(1)kc1k(2xk) 

arecalculated(takenfrom(Strang1989))andthenwewillexplainwhythe waveletsaresousefulinourproject. 

by Here,kistakensymmetricallyaroundzero.Thescalingfunctionisdened (19) 

underconditions (x)=Xck(2xk) Zdx=1 (20) 

TheHaarwaveletisthesimplestwavelet.Forthiswaveletwechoosec0=1 Xkck=2 (21) 

andc1=1.Thescalingfunctionforthesecoecientsisablockfunction denedby (22) 

ThewaveletasshowninFigure15,isdescribedby (x)=(10x1 

WHaar(x)=(2x)(2x1) 0otherwise (23) 

Westartwithavector(f1;f2;:::;fN),whichareN=2jequallysampled (24) 

Figure15:HaarwaveletfromboxfunctionWHaar(x)=(2x)(2x1). valuesofafunctionf(x)onaunitinterval.Thisvectorwillbeapproximatedbythesumofdierentweightedblockfunctions.InFigure16the 

0 1 

rst8blockfunctions,belongingtothelevels0,1and2,aregiven.The projectedinputvectorisdescribedbythecoecientsthatcorrespondtothe


weighingoftheseblockfunctions. Howarethesecoecientscalculated?Onavectorxof2jvaluesweperform L(x),calculatesthemeanandthesecond,H(x),calculatesthedierence. twooperationsL:R2j!R2j1andH:R2j!R2j1.Therstoperation, 

L(x)=k0 B @x1+x2 xN1+xN x3+x4 . 1 CA H(x)=k0 B @x1x2 x3x4 

Here,usuallyk=12indecompositionandk=1inreconstruction,but xN1xN . 1 CA (25) 

wecouldalsousek=12p2inboththedecompositionandreconstruction, doneintheexperimentsdescribedlater). ThevectorsproducedbyL(x)andH(x)arebothhalfthesizeoftheoriginal whichhastheadvantageofnormalizingthewaveletsateveryscale(thisis 

level,levelj1.Tondthecoecientsatthenextdetaillevel,weperform vector.ThecoecientsfoundbyH(x)arethecoecientsonthenestdetail atthepreviouslevel.Thiscontinuesuntilwereachlevel0. H(x)andthecoecientatlevel0foundbyL(x).So,ifwenameLithe averageoperatoratleveliandHithedierenceoperatoratleveli,thenew theoperationsL(x)andH(x)recursivelyonthevectorproducedbyL(x) Theprojectedinputconsistsofthecoecientsperdetaillevelfoundby Onlevelithereare2icoecientsproducedbyHi,thedierenceoperator. inputvectorisconstructedby(L0;H0;H1;:::;Hj1). Onlevel0wehaveanextracoecientcomingfromL0.Thetotalnumber ofinputsis whichisequaltothesizeoftheoriginalinputvector. Wewillnowpresentanexampleforavectoroflength23,x=(1;3;2;2;5;3;8;0). N=1+20+21+:::+2j1=2j (26) 

Intherststeponlevelj=2,wegetfork=12 L(x)=0B@24 1 CAandH(x)=0B@1 

Wecontinueonlevelj=1withL(x)=x0asinputvector 014 1 CA (27) 

L(x0)= 24!andH(x0)= 0! (28)


= 

C 

C 

0 

1 

x 

x 

+ 

+ 

C 

2 

x 

+ 

C 

3 

x 

+ 

C 

4 

x 

+ 

C x 

+ 

5 

C x 

+ 

net. 

6 

Andnallyonlevelj=0wendforL(x0)=x00 Figure16:Theinputsignaliswrittenasaweightedcombinationofrectangularblockfunctions.Thecoecientsciareusedasinputstotheneural 

C 

7 

L(x00)=3andH(x00)=1 

x 

Thevectorofwaveletcoecients,f0(x)is(L(x00);H(x00);H(x0);H(x)) (29) 

f(x)= 0 B@ 1 325380 1 CAandf0(x)= 0 B@ 1 3 014 1 CA (30)


thewindowismoreimportantthantheinformationontheedges.Weknow exactlywhichcoecientsarecorrespondingtotheedgesandwhichtothe Whatissointerestingaboutthewaveletcoecients,isthattheydescribea centerofthewindowandwecanremove(someof)thecoecientsthatare verylocalareaoftheinput.Weexpectthattheinformationinthecenterof 

coecient,thetransitionsintherstquarterofthewindowcanonlybe describedbystepsofw=4ofcoecientc2.Inthiscasewelooseresolution Coecientc4fromFigure16,forexample,describesthetransitionsinthe correspondingtotheedges,whenwearenotinterestedinthisinformation. 

Ifwehaveaninputcomingfromaslidingwindowofw=25.4feet,sampled ontherstquarterofthewindow. rst14ofthewindow(sizew)withstepsofsizew=8.Ifweremovethis 

every0.2feet,wehave128inputvaluesand7detaillevels0to6.Onthe nerdetaillevels(5and6)weremoveanumberofcoecientsontheedges aredescribedbystepsof0.2feetandontheedgesbystepsof0.8feet(this isdetaillevel4). InFigure17isshownwhathappenswhenanumberofcoecientsonthe accuracyastheoriginalinputinthecenter.Inthecenterthetransitions ofthewindow.Welooseresolutionattheedges,butweretainthesame 

edgesisremoved.Thetransitionsinthecenteraredescribedveryaccurately, butontheedgesweuselargerstepsthanintheoriginalinput(atransition isthenapproximatedbystepsinsteadofonetransition). 

center of sliding window 

accurately. Figure17:Boundariesoutsidethecenterofthewindowarenotdescribed 

approximation 

true boundaries

2.3Architectureconstraints 27 

tailedinformationinthecenterispreserved.If,forexample,weremove20 5,weremoved7.8feetonlevel6andalso7.8feetatlevel5.Thismeans thatatthesepartsofthewindowwehavearesolutionof0.8feetandinthe Thecoecientsareremovedfromtheedgesofthewindow,soallthede- 

coecientsoneachsideonlevel6and10coecientsoneachsideonlevel center9.8feetwehavearesolutionof0.2feet. Howmuchinformationislostbythisoperationdependsonthelocation ofthebedboundariesandthesharpnessofthetransitions.Aresistivity contrastisapproximatedbysmallstepsinsteadofonetransition.Ahard duringtrainingandtesting. imationaectsthegeneralizationperformanceinanegativewaywillshow measureforthelossofinformationisdiculttogive.Whethertheapprox- 

2.3Architectureconstraints Thesecondmethodtoreducethecomplexityofthenet,aswehadalready onthearchitecture.Theadvantageofthismethodisthatnoinformation mentionedinthebeginningofthissection,isforcingcertainconstraints islost,likeinthepreprocessingmethods.Wecanalsoreectourprior knowledgeabouttheinputinthedesignofthenet.Thisfacilitatesthe Byusinglocallyconnectedneurons(receptiveelds),thenetcontainsmuch learningprocessandhopefullywillimprovethegeneralizationofthenet. 

architectures. advantagesanddisadvantagesofvariousfullyandlocallyconnectednetwork isfurtherinvestigatedinSection2.3.4.Inthissectionwewilldiscussthe thenumberofweights,byforcingsomeoftheweightstobeequal.This lessweightsthanthefullyconnectednets.Wecanevenfurtherreduce 

leftsideofFigure18(inthisgureonlytheconnectionstothersthidden 2.3.1Fullyconnectednets nodearedrawn).Allnodesofonelayerareconnectedtoallnodesofthe Acommonnetworkarchitectureisafullyconnectednetasshownonthe 

muchfreedomanditwillnotgeneralize.Morehiddenlayerscanbeadded previouslayer.Thenumberofnodesofthehiddenlayerisveryimportant. theproblem.Butitshouldalsonotbetoolarge,otherwisethenethastoo toimproveonthetrainingandgeneralizationresults.Usuallyonehidden Thisnumbershouldnotbetoosmall,otherwisethenetisnotabletolearn 

oftheinput. layerisenoughtolearnaproblem,butsometimesanextralayerhelpsto combinethefeaturesfoundbytherstlayer.Itprovidesamoreglobalview


Theinputhasastronglocalstructure,soitsimpliestheproblembyusing Figure18:Fullyconnected(left)andlocallyconnected(right)neuralnets. 

Rt 

Rt 

Rxo 

Rxo 

onlyseesapartoftheinputandnotalltheinputsasinthefullyconnected 2.3.2Locallyconnectednets so-calledreceptiveelds.Itmaybeeasierfortheneuralnetifaneuron 

dxo 

dxo 

nets.Theneuronspecializedonitspartoftheinputandcanbeusedasa localfeaturedetector.Weaddanextrahiddenlayerinordertocombine thelocalfeaturesproperly.Thepartoftheinputaneuronisconnected to,iscalledareceptiveeld.Usuallyallreceptiveeldshavethesamesize Theweightkerneloftherstreceptiveeldthatisconnectedtotherst andareonlyshiftedinspace(ortime)withaxedstep(xedoverlap). neuronisshowninFigure19.Wecanconstraintheweightkernelsforthe theneuralnet).Thedecreaseoffreedommightimprovethegeneralization variousreceptiveeldstobethesame,thisiscalledweightsharinganditis abilityofthenet,butifthefreedomisreducedtoomuch,thenetoverall usedintheconvolutionalnetworks.Theadvantageofweightsharingisthe decreaseinthenumberofweightsandfreedom(andthereforecomplexityof performancemaydecrease.Themotivationforweightsharingisthatwe expectthataparticularmeaningfulfeaturecanoccuratdierenttimes(or locations)intheinput.Anexampleofalocallyconnectednetisshownin 

Inourmodelswehavenodippinglayersandnodeviatedboreholes.In Figure18(right). 2.3.3Symmetryconstraints 

foracertainweightvectorw.Themirrorimageofx=(x1;x2;:::;xN)is neuralnet(inahiddennodeatthersthiddenlayer)toasignalxisf(x;w), response.Thisshouldalsoholdfortheneuralnet.Theresponseofthe asignalanditsmirrorimage,asshowninFigure20,givethesametool thiscasethetoolreadingsareassumedtobesymmetric.Thismeansthat


W 

11 

W 

12 

W 

31 

W 

21 

W 

22 

W 

32 

W 

31 

W 

32 

W 

33 

weight kernel for 

first receptive field 

n 

x0=(xN;:::;x2;x1).Theresponseoftheneuralnettothisinputisf(x0;w). Figure19:Receptiveeldswhichoverlap13. 

m 

first receptive field 

first field shifted down 2/3 

first field shifted right 2/3 

Wewillnowinvestigatewhattheimplicationsareforthearchitecturesof thenetswhenweforcef(x;w)f(x0;w),given Figure20:Adiscretizedinputsignalanditsmirrorimage. 

x1 x2 x3 . . . xn xn . . . x3 x2 x1 

foranyhiddennodejinthersthiddenlayer,theindexjisomitted. Forthefullyconnectednetsthefollowingequationshouldhold Here,jstandsforhiddennodej.Sincethefollowingequationsshouldhold 

8x2


Thisisonlytruewhenwi=wN+1i.Thisconstraintcanbebuildinto weightscomingfrominputnodeN+1i.Theseconstraintsareshownin thenetbyforcingtheweightscomingfrominputnodeitobeequaltothe 

Figure21:Symmetryconstraintsonfullyconnectednet. 

eldsareconstrainedsymmetrically(receptiveeldfisconstrainedtoeld Ff+1forFelds). Figure21.Thesameconstraintscanbeusedforthelocallyconnectednet Figure22:Symmetryconstraintsonlocallyconnectednet.Thereceptive 

symmetricallyconstrained.Thismeansthatreceptiveeldfisconstrained toreceptiveeldFf+1,whereFindicatesthetotalnumberofreceptive asshowninFigure22.Inthelocallyconnectednetsthereceptiveeldsare elds.Theeldsarealsointernallyconstrainedsymmetricallyasshownin Figure22. constraints.Thewaveletvectorofaninputx=(x1;x2;:::;xN)(N=2j)on Forthenetsthataretrainedonwaveletcoecients,wecanalsondsome


Figure23:Symmetryconstraintsonwaveletnets.Thecoecientsareconstrainedperdetaillevel. 

acertainlevelisconstructedby L(x)=k0B@x1+x2 

0 1 

x3+x4 

2 

level 3 

xN1+xN . 1 CA H(x)=k0B@x1x2 

ThecoecientsonthislevelcomefromH(x).Thecoecientsforthemirror xN1xN x3x4 . 1 CA (33) 

imageofxarecalculatedby L(x0)=k0 B @xN+xN1 x2+x1 x4+x3 . 1CA H(x0)=k0 B @xNxN1 x2x1 x4x3 . 1CA Thecoecientsonhigherlevelsarecalculatedrecursivelyonthevector (34) 

signalx.Forthe2jwaveletcoecientsonlevelj>0thefollowingequation H(x0)shouldbethesameoneverydetaillevelandforeverypossibleinput shouldholdforanywaveletvectory=H(x)andy0=H(x0) ofL(x).TheresponseofthenettoH(x)andtheresponseofthenetto constructedbyL(x).OneverylevelwendthatL(x0)isthemirrorimage 

8y2


describesthedierencebetweentheaverageoverthersthalfofthewindow 

networkisshowninFigure23. positiveornegative. Onleveljwendw2j+i=w2j+1ifori=0;1;:::;2j1.Theresulting andtheaverageoverthesecondhalfofthewindow.Thetoolreadingsshould besymmetricsoforthetoolitdoesnotmatterwhetherthisdierenceis 

Aneuronthatislocallyconnectedextractslocalfeatures.Sometimeswe featuresthemselves(forexampleincharacterrecognition). 2.3.4Convolutionalnetwork arenotconcernedaboutthelocationofthesefeatures,butonlyaboutthe 

convolution subsampling convolution subsampling convolution 

input 

output 

neuroninthersthiddenlayertodetectaspecicfeatureatthelocationof thereceptiveeld.Wecanusethesamesetofweightsforallthereceptive Thesetofweights,w,belongingtoareceptiveeld,makesitpossiblefora Figure24:Convolutionalnetwork. 

map, all neurons 

this map share their 

wasrstonlydetectedbythersthiddenneuron,anywhereinthetotal elds.Thisenablesthersthiddenlayertodetectthatspecicfeature,that 

input.Theneuronsinthersthiddenlayerthatusethissetofweightsto ofanumberofneuronsthatmakeuseofthesamesetofweights(weight detectaspecicfeature,arecalledafeaturemap. 

kernel.Theconvolution,performedbyonefeaturemap,isdenedbythe sharing).Thissetofweights,asshowninFigure19,isusedasaconvolution mapstomakeitpossibletodetectmorefeatures.Eachfeaturemapconsists Thehiddenlayerisnowabletodetectonefeature.Wecanaddmorefeature 

sum yj=k+n1 Xkl+m1 Xlwjklxkl (36)


wherexklistheinputpixelatlocation(k;l)andwjklistheweightbetween thisinputpixelandneuronj.Theindiceskandlindicatetheleftupper cornerofthereceptiveeldandnmindicatethesizeofthereceptiveeld. ceptiveeldsandsharedweights.Theoverlapofthereceptiveeldsis subsamplinglayersareliketheconvolutionlayers:theyalsomakeuseofre- 

Theconvolutionlayersarealternatedbyso-calledsubsamplinglayers.These 

isreduced.Duetothisreductioninresolution,thislayerprovidessome allatthesubsamplinglayers.(Whenthereceptiveeldsdonotoverlap maximalthisisusuallycalledsubsampling).Inthesubsamplinglayerthe spatialresolutionofthefeaturemaps,generatedbytheconvolutionlayers, maximal(replacementofonepixel)attheconvolutionlayersandnotat 

degreeoftranslationalandrotationalinvariance. Thistypeofnetworkiscalledaconvolutionalnetworkandisshownin Figure24. 

Theprinciplesaremoreorlessthesame.Atimedelayednetworkisusedin recognition.Theone-dimensionalversioniscalledatimedelayednetwork. 2.3.5TimeDelayednetwork 

applicationslikespeechrecognition(Waibel,Hanazawa,Hinton,Shikano& Convolutionalnetworksareusedintwo-dimensionalproblems,likecharacter Lang1989). Theideaisthattheresponseatacertainpointoftime(ordepth)depends onpreviousinputswithacertaindelay(hencethenametimedelayed).In 

featuresinthenextconvolutionlayer.Thisiscalledbi-pyramidalscaling bytherstlayerispartiallycompensatedbyanincreaseinthenumberof Thisisbecausewedonotwantthetranslationalandrotationalinvariance, thatisprovidedbythesubsamplinglayers.Thelossoftimeresolution thistypeofnettherearenosubsamplinglayers,onlyconvolutionlayers. 

(Guyon1991).Thearchitectureforatypicaltimedelayednetworkisshown inFigure25. 


feature 

maps 

(copies) 

features in time 

Figure25:Timedelayedneuralnetwork. 

feature map 

(shared weights within map) 

features 

(or variables) 

receptive field


convolutionalnetworksandthetimedelayednetworks.Theideabehindthis 2.3.6Convolutional-regressionnetwork Thedesignoftheconvolutional-regressionnetworkisbaseduponboththe 

copies 


feature 

maps 

feature map 


features 

typeofnetworkisthatthenetworkoutputdependsonanumberofinputs 

(or variables) 

aroundthepointofinterest.Soitdependsoninputsaboveandbelowthe loggingpoint.Forthisnetweusetheinputrepresentationproducedbythe Figure26:Convolutional-regressionnetwork. 

features time 

(uniform)discretizedslidingwindowapproach. 

receptive field 

thepreviouslayer.Allotherlayersoftheconvolutional-regressionnetwork nalconvolutionalnetworks,butincontrastwiththetime-delayednetworks. Inthelatterthefeaturemapsarealwaysconnectedtoallthefeaturesof someofthefeatures(variables)intheinputlayer.Thisisliketheorigi- 

Thefeaturemapsinthersthiddenlayercanbeconnectedtoalloronly 

informationoftheinput.ThearchitectureisshowninFigure26.Inthisexamplethefeaturemapsareconnectedtoonlyanumberoftheinputfeatures. 

arefullyconnected.Sothisnetonlycontainsoneconvolutionlayer(and thisisdierentfromboththeconvolutionalandtime-delayedarchitectures). Weonlyuseoneconvolutionlayer,becausewewanttoretainthespatial 

weights)thatisneededissmall(thenumberofhiddennodesisequaltothe numberofreceptiveelds).Adisadvantageisthelossofspatialresolution. overconvolutionisthatthenumberofhiddennodes(andthenumberof thisisnotaconvolution,butsubsampling.Theadvantageofsubsampling Thereceptiveeldsintherstlayerdonotoverlapmaximal.Soactually

2.4Errorfunction 35 

Afeaturemapfromthersthiddenlayerdetectsafeatureintheinput.The 

2.4Errorfunction hiddenlayer. secondhiddenlayerdeterminestheimportanceofthelocationofthatfeature.Sothespatialinformationispreservedinthesecond(fullyconnected) 

Aneuralnetworkcanminimalizeanyerrorfunction.Themostcommon trainingset errorfunctionusedisthesumsquareerror,whichis,forPpatternsofthe 

patternp. Here,dpisthedesiredoutputofpatternpandapistheactualoutputof ENet=PXp=1(dpap)2 (37) 

Error 

10 

Relative error 

Network error 

8 

6 

4 

Inthisprojecttheperformanceoftheneuralnetismeasuredintherelative 

2 

errorbetweenthedesiredandtheactualoutput,averagedoverPpatterns Figure27:Thenetworkandrelativeerrorforxedproportionsd=a. 

0 

0 1 2 3 4 5 6 7 8 9 10 

ERel=vut1PPXp=1 dpap dp!2 (38) 

a/d 

Theneuralneterrorforonespecicpatternpcanbewrittenas ENet(p)=d2p 1ap dp!2 (39)


patternswithahighdesiredoutput.Thisisnotwhatwewant,becausea smalldesiredoutputismorelikelytohaveahighrelativeerrorthanahigh errorwhenthedesiredoutputislarge.Thenetemphasizesthelearningof Thismeansthatforthesameproportiona=d,apatternpproducesahigher 

Beforepresentingthepatternstothenet,wehavescaledtheinputandthe targetvalue.ThuswepreferredtheerrorERelasgivenineq.38,which output.Thedesiredandactualoutputsarescaledby overcomesthisdisadvantage. 

followingerrorENet=PXp=1ln(dp) Theconsequenceofthisscalingisthatthenetworkisminimalizingthe t0=ln(t) (40) 

=12PXpln2dp ln(a) 2 

ap (42) (41) 

whichisalmostthesameasminimalizingtherelativeerror,especiallywhen relativeerrorERelaredrawninFigure27. givethesameerrorsignal,asshownbytherelativeerrorinFigure27.The theproportiond=a.Forxedproportionsd=athenetworkerrorENetand Anoverestimation(a=d+k)andanunderestimation(a=dk)must theproportiond=aisalmostequalto1.Actuallythenetisminimalizing 

overa=d+k,becausetheformerproducesalowererrorsignal. neuralnet,withtheerrorfunctionasmentionedineq.41,favoursa=dk 

wewanttominimalizetherelativeerror.Whenwehadusedanotherscalingmethod(notlogarithmic),thenetworkerrorfunctionshouldhavebeen 

Itisveryimportanttochooseanappropriateerrorcriterion.Inthisproject targetvalues,whichisalsothecasefortherelativeerror. Itdoes,however,emphasizethelearningofsmalltargetvaluesoverlarger 

adaptedforbetterperformance.Withthisscalinghoweverthesumsquare errorfunctionsuces.

measuredresistivityisheavilyaectedbytheinvasionresistivity(Rxo),althoughthisdependsontheinvasionradius(dxo).Beforeinvestigatingthis 

Forwardmodellinginthepresenceofmudinvasionisverycomplex.The 3Forwardmodellingwithoutmudinvasion 

weonlyhaveonevariable(Rt).Theboreholeradiusanddrillinguidresistivityarexedto4.25inchand0.05m,respectively.Withonlyone 

complexsituation,werstlookatthecasewithoutmudinvasion.Here variabletheinputspaceofearthmodelsisnottoolarge,whichallowsusto 

Wehavetakensmallbedsizesthatrangebetween1and5feet.Each wecanusethisinthecasewithinvasion. rstexperimentwiththeinputrepresentationandthenetworkarchitecture. 

earthmodelconsistsofasmalllogof50feetlongand15beds,withatotal Whenwehavefoundthemostappropriaterepresentationandarchitecture, 

summarizedinTable2. of47models(2350feet)fortrainingand33models(1650feet)fortesting. Thetrueresistivityrangesbetween1and70m.Theseparametersare Fortheexperimentationwiththeinputrepresentationandnetworkarchitecturewehaveusedasmalltrainingsetof4earthmodelsandanequallysized 

testsetof4models.Forthesesmalltestsnoabsoluteerrorsareincluded, theaveragerelativeerrorofthetrainingsetandthetestset.Anetperforms sincetheyareonlyusedforcomparison.Theperformanceismeasuredin wellwhenbotherrorsaresmallandcomparable. Table2:Parametersforearthmodelswithoutinvasion. Fixedparameters Boreholeradius Drillinguidresistivity Variables Bedsize 4.25inch 

Trueresistivity 1,2,3,4or5feet 1,2,3,...,70m 0.05m 

Data Trainingset Testset Permodel 251examples 47models 33models 50feet 15beds 

37

38 3.FORWARDMODELLINGWITHOUTMUDINVASION 

Theinputandoutputareapproximatelyscaledtotherange[1;+1](see 3.1Experimentingwithdierentscalingmethods Section2.1.3).Thisisdonebynormalizationoftheinputandoutput 

whereisthemeanandisthestandarddeviationofthevariablezonthe z0=z 

logarithmicscaling.Inthatcasethemeanandstandarddeviationofln(z) trainingset.Thistypeofscalingcanalsobedoneincombinationwitha (43) 

isused.Themeanandstandarddeviationsfortheinputandoutputare giveninthefollowingtable logarithmicscaledRt logarithmicscaledLLdorLLs3.000.83 29.0016.67 30.0016.67 3.300.80 

applied,sothenormalizationonitselfisnotsucient.Whennologarithmic bestresults.Theworstresultsarefoundwhennologarithmicscalingwas thatthelogarithmicscalingincombinationwiththenormalizationgavethe andonehiddenlayerof5nodeson1004examples(4models)andfound Wetestedafullyconnectednetof25inputs(slidingwindowof5feet) 

errorandaminimalizationoftheabsoluteerrordoesnotnecessarymeana problem.Duetothisscalingtheproportionisminimalized(seeSection2.4), thedesiredandactualoutput.Wemeasuretheperformanceintherelative minimalizationoftherelativeerror.Thelogaritmicscalingovercomesthis scalingwasapplied,theneuralnetisminimizingtheabsoluteerrorbetween 

whichisalmostthesameasminimalizingtherelativeerror. Wewillusethiscombinedscalingmethodintheothertests. 

experimentsaredoneforawindowsizeof5feet.Thetestsaredoneforboth 3.2Experimentingwiththenetworkarchitecture Nowwestartedexperimentingwiththenetworkarchitecture.Thefollowing theDeep-andtheShallow-logwith4trainingmodels(1004examples). Variationofnumberofhiddennodes(5,10,15,20,25).Wefound Variationofnumberofhiddenlayers(1or2).Twolayersofboth15 thatnetswith15nodesandmoregavesimilarresults. thatincreasingthenumberofnodesupto15improvetheresults,but nodesoronehiddenlayerof15nodesgavecomparableresults.

3.3Experimentingwiththesizeoftheslidingwindow 39 

Whentheresultsarecomparablewechooseforthenetwiththefewest Addingconnectionsoftheinputlayerdirectlytotheoutputlayer(25 

numberofweights.Thesenetsmostlikelyhavethebestgeneralizationand thenormalfullyconnectednetwerecomparable. extraconnections).Againtheresultsfortheseextraconnectionsand 

thesmallesttrainingtime.Thebestarchitecturefoundbytheseexperiments 

Thescalingmethodandthenetworkarchitecturearexed,onlythewindow 3.3Experimentingwiththesizeoftheslidingwindow isanormalfullyconnectednet,withonehiddenlayerof15nodes. 

sizeisvariedinthefollowingtests.Wetriedwindowsizesof5,10,15and 30feet.Thesmallestwindowsizewecantryis5feet(seeSection2.1.1). Againthenetsaretrainedon4models(1004examples),onboththeDeepandtheShallow-log.Inbothcases,awindowsizeof15feetgivesthebest 

thisdoesnotoutweighthelongertrainingtimeduetothishighnumberof weights(2281versus1156). results.Althoughtheresultsareslightlybetterforawindowsizeof30feet, TheShallow-logapproximationisbetterthantheDeep-logforthesame windowsize.Wealreadyexpectedthis,becausetheDeepLaterolog\sees" alargerpartoftheformationandthereforeneedsalargerwindowsize.This 

Thearchitecturewefoundbytheprevioustestsisafullyconnectedneural 3.4Summaryandresults wasalreadyshowninFigure3inSection1.1.1. 

netwithonehiddenlayerof15nodesandawindowsizeof15feet(75inputs).Thisnetistrainedon47modelswithatotallengthof2350feetand 

testedon33modelswithatotallengthof1650feet. 

Table3:Averagerelativeerrorforearthmodelswithoutinvasion(1). Shallow-logTrainingset1.7% Deep-log Trainingset5.1% Testset 2.2% 6.0%

40 3.FORWARDMODELLINGWITHOUTMUDINVASION 

proximationoftheShallow-log.Thegeneralizationabilityofthenet,that AsshowninTable3,theaveragerelativeerrorisbelow5%fortheap- 

istrainedontheShallow-log,isquitegood(thetesterrorandthetraining relativelylargecontributiontotheaveragerelativeerror.Therefore,wealso errorarecomparable). However,theaveragerelativeerrormaynotbeanappropriateerrorcriterion,sinceitispossiblethatafewpointswithahighrelativeerrorhavea 

calculatethepercentageoftheoutputthathasarelativeerrorbelow5% (called\correct")asshowninTable4.Fromthistablewendthatonly Table4:Performanceonearthmodelswithoutinvasion(1). Shallow-logTrainingset98%correct Deep-log Trainingset62%correct Testset 96%correct 

4%oftheapproximationoftheShallow-loghasarelativeerrorabove5%. thelearningandgeneralizationabilityofthenetarequitegood. Andthisisveryclosetotheerroronthetrainingset.Therefore,wecansay 61%correct 

oftheshoulderbeds,whichislessprofoundinthecaseoftheShallow-log TheapproximationoftheDeep-log,however,causesmoreproblems.This sistivityprole,incontrasttotheShallow-log.Thisiscausedbytheeect (seeSection1.1.2). isshowninFigure28.TheDeep-logdierssubstantiallyfromthetruere- 

TheoutputtheneuralnetgivesforthismodelisshowninFigure29.In thisgureoneseesthattheoutputgivenbythenetlooksmoreliketheresistivityprole(liketheShallow-log)andthereforehasquitealargerelative 

errorwithitstargetvalue(theDeep-log).Animprovementcouldbemade Weconcludethatourinputrepresentation(slidingwindowandscaling)is byusingmoremodelssimilartothisoneinthetrainingset. 

willabandonthefullyconnectednetsandlookformorecomplexarchitecturestoimproveontheresultsandtohandleinvasion. 

appropriatefortheproblem.Afullyconnectedneuralnetwithonehidden generalizationoverdataithasnotseenbefore.Inthefollowingchapterwe layerissucienttolearntheinput-outputmappingandtoprovidegood

3.4Summaryandresults 41 

Ohm meter 

80 

70 

True resistivity 

Deep-log 

Shallow-log 

60 

50 

40 

30 

Figure28:ModelthatcausesdicultiesinapproximatingtheDeep-log. 

20 

10 

0 

0 5 10 15 20 25 30 35 40 45 50 

depth 

Ohm meter 

80 

70 

True resistivity 

Neural Net 

Deep-log 

60 

50 

40 

30 

Figure29:Responseoftheneuralnettothedicultmodelshownabove. 

20 

10 

0 

0 5 10 15 20 25 30 35 40 45 50 

depth

42 4.FORWARDMODELLINGWITHMUDINVASION 

4Forwardmodellingwithmudinvasion Nowwehavefoundthatitispossibletolearnthemappingbetweenthe earthmodelsandthetoolresponse,wearegoingtolookatamoredicult (andinteresting)problem.Inthenewearthmodelsthelayersoftheformationareinvadedbythedrillinguid.ThismeanswealsohavetotakeRxo, 

boreholeradiusandtheresistivityofthedrillinguidarexed.Theinput theinvasionresistivity,anddxo,theinvasionradius,inaccount.Againthe 

moreexamplestolearntheproblem. thethreevariablesRt,Rxoanddxo,ismuchlargerthanintheprevious spaceofearthmodels,constructedfromrandomlychosencombinationsof 

Werstperformanumberoftestsonasmalltrainingsetinordertoget anideaabouthowwellthechoseninputrepresentationandarchitecture case,whentherewasonlyonevariable(Rt).Weexpectthenetworkneeds 

performonan(alsosmall)testset. 

Fixedparameters Boreholeradius Table5:Parametersforearthmodelswithinvasion. 

Drillinguidresistivity Variables Bedsize 1,2,3,...,20feet 0.05m 4.25inch 

Trueresistivity Invasionresistivity Invasionradius Data 0.5,0.7,...,2.5m 2,3,4,...,71m 

Trainingset Validationset A,B,C,D,E,F(M,N,O,P,Q,RandS)models 8,9,....,50inch 

Testset Permodel J,KandLmodels G,HandImodels 1000feet 

Weusebedsizesbetween1and20feet,whichismorerealisticthaninthe 5000examples 80beds 

training,theso-calledvalidationset.Whentheerroronthissetstartsto 19modelsfortraining,testingandvalidationasshowninTable5.During trainingonecalculatestheerroronasetofexamplesthatisnotusedduring isapproximately1000feetlongandconsistsof80beds.Thereisatotalof previoustests,whereweusedbedsbetween1and5feet.Eachearthmodel 

increase,thetrainingisstopped.Inthesmalltestswehavenotuseda

4.1Scalingoftheparameters 43 

validationset.Thetrainingwasstopped,whenthenetworkconvergedtoa 

tolearnthanthemappingbetweentheearthmodelsandtheDeep-log.Ifa isminimal,buttheerroronthetestsetisnot.Intheseexperimentsweonly thatthemappingbetweentheearthmodelsandtheShallow-logwaseasier usetheShallow-log.Wehavefoundintheprevioustests,withoutinvasion, minimum.Thismeansthenetsareovertrained:theerroronthetrainingset 

(A)of4601samplepointsastrainingsetandonemodel(F)of4876sample Inthenextsectionssomeoftheresultsaregiveninduplicateinorderto getabettercomparisonbetweenthevarioustests.Wehaveusedonemodel moredicultyinlearningtheDeep-log. netndsitdiculttolearntheShallow-log,weexpectitwouldhaveeven 

ThetrueresistivityRtandtheShallow-andDeep-log(LLsandLLd)are 4.1Scalingoftheparameters pointsastest(generalization)set. 

scaledasintheprevioustests.Thescalingmethodswehaveusedhereare summarizedinthefollowingtable VariableDomain Rt Rxo dxo LLd 2,3,...,71 0.5,0.7,...,2.3mnorm mlog+norm Scaling MeanDeviation 

LLs 8,9,...,50 inchnorm log+norm29.00 3.30 1.30 1.40 25.00 

0.59 

Inthefollowingsectionsweusetablestodescribetheperformedtests.The 1.67 

followingtableentriesareusedtodescribeatest: 

architecture:Adescriptionofthenetperlayerseperatedbya\.". connections:Thetypeofconnections.Thisisfullyorlocallyconnected(fullyconnectedbydefault). 

3128forexamplemeanswehave128valuespervariable.Alayerof splitintothenumberofvaluespervariableorfeaturemap.Alayerof Foreachlayerthenumberofnodesisgiven.Thisnumbercanbe 

windowsize:Theusedsizeoftheslidingwindowisgiveninfeet Numberofweights:Thenumberofweightsinthenetisgiven. 627meanswehave6featuremapsof27nodeseach. 

Samplingperiod:Theusedsamplingperiodisgiveninfeet(default valueis0.2feet). (defaultvalueis25.4feet).


Sampling:Thetypeofsampling.Thiscanbeuniform(defaultvalue), Numberofepochs:Thenumberofepochsthatwasneededtotrain non-uniformornone. 

Generalizationerror:TheaveragerelativeerroronmodelF. Trainingerror:TheaveragerelativeerroronmodelA. thenet. 

4.2Experimentingwithdierentinputrepresentations 

inTest1.Awindowsizeof25.4feetandasamplingperiodof0.2feetgives attheinputrepresentationcomingfromauniformsampledslidingwindow Becauseoftheresultsintheexperimentswithoutinvasion,werstlooked Inthismorecomplexsituationwehavetolookatthreevariablespersamplingpoint.Thisresultsinahighnumberofinputsfortheneuralnet. 

us3128inputs,whichisquitehigh.ButwealsolookedataninputdescriptionbyattributesinTest2asdescribedinSection2.1.2.Theresults 

Sincetheslidingwindowrepresentationisgivingbetterresultsthanthe aregiveninTable6. 

bad.Thenetisheavilyovertrained,sincetheerroronthetestsetwasonly thenumberofweights. Thegeneralizationperformanceoftheattributesinputrepresentationisvery attributesdescription,wewillnowlookformethodstoreducethecomplexity 

41.8%after183epochs(trainingerroratthattimewas11.4%). ofthenet.Thiscanbedonebyreducingthenumberofinputsorbyreducing 

first sampling point, 

inverse distance is 10 

0.2 feet 

second sampling point, 

inverse distance is 5 

Contrasts with adjacent bed 

Figure30:Smalldierenceininputwithattributesinputrepresentation. 

Input at first sampling point Rt Rxo dxo Rt/Rt2 Rxo/Rxo2 dxo/dxo2 Rt-Rt2 Rxo-Rxo2 dxo-dxo2 

Input at second sampling point Rt Rxo dxo Rt/Rt2 Rxo/Rxo2 dxo/dxo2 Rt-Rt2 Rxo-Rxo2 dxo-dxo2 

True resistivity, invasion resistivity and 

invasion radius 

Differences adjacent bed 

10 

5

4.2Experimentingwithdierentinputrepresentations 45 

tation.Representation Table6:Comparingdiscretizedslidingwindowagainstattributesrepresen- 

architecture numberofweights windowsize samplingperiod Test1 3128.10.1 3861 Test2 

25.4feet 100.15.15.1 

inputdescription Uniform DiscretizedslidingAttributes 0.2feet 1771 

window 30feet 

numberofepochs trainingerror generalizationerror70.0% 3060 3.1% 3484 (10beds) 4.2% >100% 

Table7:Comparinguniformsamplingmethodswithdierentsamplingperiods. 

Sampling1 architecture numberofweights windowsize samplingperiod Test1 3128.10.13100.10.1 3861 25.4feet Test4 

numberofepochs 0.2feet 3021 

trainingerror 3060 29.7feet 

generalizationerror70.0% 3.1% 0.3feet 3634 2.9% 67.3% 

Table8:Comparinguniformandnon-uniformsamplingmethods. Sampling2 architecture numberofweights windowsize samplingperiod Test3 3128.10.10.1364.15.15.1 25.4feet 3971 29.4feet Test5 

Uniform 0.2feet 3151 

numberofepochs trainingerror generalizationerror65.2% 6429 1.1% 46.4% Non-uniform 4404 1.0%


eachbeddiersinthosecases,asshowninFigure30.Theinversedistance becomesthemostimportantvalueintheinput.Thiscanalsobeseenfrom theweights.Mostweightsarequitesmall,rangingfrom-0.5to0.5,butthe fedwithanumberofalmostsimilarexamples.Onlytheinversedistancefor Apossiblecauseofthebadperformanceontheattributesisthatthenetis 

weightsbelongingtotheconnectionsfromtheinversedistance-inputsto thehiddenlayerarelarge(1.5). isverydicult.Thenetworkisactuallypushedinsomekindofdirection Itisimportanttochooseappropriateattributes,butndingtheseattributes byusingattributes.Thiscanmakethelearningeasier,butitcanjustas easymakethelearningmoredicult. 4.3Experimentingwithinputreductionmethods 

nodesandmaybeeventwohiddenlayersof15nodes.Theformergivesus veryhigh.Forexample,awindowof29.8feetandasamplingperiodof 0.2feetproduces3150inputs.Wewouldliketouseahiddenlayerof15 Thenumberofinputscomingfromauniformsampledslidingwindowis 6781weightsandthelatter7021weights.Totrainsuchalargenetweneed approximately70000examples(Widrow'sRuleofThumbtotakeabout10 timesthenumberoffreevariables).Eachexampleconsistsof451values 31570000valuestotrainthisnet.Thislargesetwillslowdownthetraining (3150inputsandonetarget),sowewouldneedtostoreapproximately Itisforthisreasonthatwelookedatseveralwaystoreducethenumberof inputsforaxedwindowsize.Thedisadvantageofinputreductionisthe andcausesstorageandmemoryproblems. factthatwelooseinformation. Thenumberofinputsdependsontheusedwindowsizeandtheusedsamplingperiod.Wecoulduseasmallerslidingwindowthan25.4feet,butthis 

isnotanattractiveoption,becausewesuspectthetoolreadingsareaected byawindowofatleastthesizeofthetool(seeSection2.1.1). 3100inputs,insteadofthe3150inputswitha0.2feetsamplingperiod (Test4)insteadof0.2feet.Inaslidingwindowof29.7feetthisresultsin inaslidingwindowof29.8feet. Anotheroptionistouseacoarsersamplingperiod,forexample0.3feet Anotherapproachistouseanon-uniformsamplingmethod.Thisapproach 

4.3.1Usingdierentsamplingmethods 

samenetasinTest1,butwithtwohiddenlayers.Theresultsforthese only364inputsperwindow(insteadof3150).InTest3weusethe uniformsamplingmethodinTest5asshowninFigure31,whichresultsin isbasedonthefactthatthephysicaltoolreceivesmostofitsinformation fromthecenterandlessfromthesides.Wehavechosenanheuristicnon-

4.3Experimentingwithinputreductionmethods 47 

of 29.4 feet 

testsareshowninTable7andTable8. Figure31:Non-uniformsamplinginslidingwindow. 

7.2 feet uniform sampled every 0.8 feet 

Whenauniformsamplingmethodisused,ititpossibletouseacoarsersamplingperiod(0.3feetinsteadof0.2feet),becausethetrainingandtesting 

6.0 feet sampled every 0.4 feet 

3.0 sampled every 0.2 resultsaresimilar.Thebedboundariesaredescribedlessaccuratelywhen acoarsersamplingperiodisused.Butfromthesetestswendthatitdoes notinuencetheperformanceinanegativeway. becausethecenterofthewindowissampledwiththesamesamplingperiod asintheoriginalinput.Thenetthatistrainedonthisinputrepresentation Withthenon-uniformsamplingmethodweonlylooseaccuracyattheedges, 

odstoreducetheinputs. performsbetterthanthenetthatwastrainedontheoriginalinput. 4.3.2Reducingtheinputbyprojectiontoprincipalcomponents Bothmethods,coarserandnon-uniformsampling,areappropriateasmeth- 

WeusetheprincipalcomponentanalysisasdescribedinSection2.2.1.The eigenvectorsarecalculatedforaninputleconsistingofsixmodels.The modelsarenotalikeinternally,soitisbettertouseasetofmodels.Otherwisetheeigenvectorswillbeappropriateforonemodel,butnotforanother. 

rescaledwiththeinversenormalizationscalingofequation9,beforethe Wefoundthatitdoesnotmatterwhetheryouusesixmodelsormore,so wehavechosenforthemodelsA,M,N,P,QandR(thesemodelsarevery dissimilar). 

iseverywhereequaltothemeanoftheinputvalues.Thisrescalingisnot not100%.Thescaledvectorconsistsofonlyzero's,buttherescaledvector LOIiscalculated.Duetothisscaling,thelossofinformationforM=0is Bothvectors(originalN-dimensionalandprojectedM-dimensional)are 

absolutelynecessary,butwewantedtocalculatethelossofinformationon theoriginalinput. InTest7theoriginalinputvectorconsistsof364inputs,constructedby


Loss of information 

in percent 

50 

45 

40 

LOI on true resistivity Rt 

LOI on invasion resistivity Rxo 

LOI on invasion radius dxo 

35 

30 

25 

20 

15 

Figure32:LossofinformationpervariableforTest6.Originalinputconsists of3128inputs,comingfromauniformsampledslidingwindow. 

10 

5 

0 

0 8 16 24 40 48 56 64 72 80 88 96 104 112 120 128 

Number principal comonents used (M) 

principalcomponents. Table9:Comparinguniformsampledinputwithoutandwithprojectionto PCA1 architecture numberofweights Test3 3128.10.10.1332.10.10.1 No 3971 Test6 1091 Yes 

numberofepochs LOI7.4%onRt 

trainingerror 6429 LOI4.7%onRxo 

generalizationerror65.2% 1.1% 52.3% LOI4.4%ondxo 5193 1.7%


Loss of information 

in percent 

50 

45 

40 

LOI on true resistivity Rt 

LOI on invasion resistivity Rxo 

LOI on invasion radius dxo 

35 

30 

25 

20 

15 

Figure33:LossofinformationpervariableforTest7.Originalinputconsists of364inputs,comingfromanon-uniformsampledslidingwindow. 

10 

5 

0 

0 8 16 24 32 40 48 56 64 

Number principal comonents used (M) 

tiontoprincipalcomponents. Table10:Comparingnon-uniformsampledinputwithoutandwithprojec- 

PCA2 architecture numberofweights windowsize samplingperiod Test5 364.15.15.1332.15.15.1 29.4feet 3151 Test7 

PCA Non-UniformNon-Uniform 

1711 29.4feet Yes 

numberofepochs LOI7.2%onRt 

trainingerror 4404 LOI4.2%onRxo 

generalizationerror46.4% 1.0% LOI3.9%ondxo 3896 1.4% 45.6%


TheLOI,forbothtests,iscalculatedforP=1520patternscomingfrom modelsJ,OandS. valuesofMisshowninFigure33. anon-uniformsampledslidingwindowasinTest5.TheLOIforvarious 

forthetrainingandtestset. Inthefollowingtables,Table9andTable10,theLOIisgivenpervariable 

weightsarereducedwith73%and46%respectively. weachieveaninputreductionof75%andinthesecondtest50%.The Inbothtestswendthatthetrainingandgeneralizationerrorforthetests analysisisagoodmethodtoreducethenumberofinputs.Inthersttest withandwithoutPCAarecomparable.Thismeanstheprincipalcomponent 

4.3.3Reducingtheinputsbyremovingwaveletcoecients originalinputs.Wedonotknowhowmuchcoecientscanberemovedon Inthefollowingtestsweusewaveletcoecientsasinputsinsteadofthe thenerdetaillevelssothatnottoomuchinformationislost.Therefore, 

Therstinterestingaspectisthatthenetlearnsbetterwhencoecientsare TheresultsaregiveninTable11andTable12. thecoecientsonthenestdetaillevelandsomeonhigherdetaillevels. coecientsareremovedatallandinthelasttest,Test11,weremoveall werunseveraltestswithdierentreductions.Inthersttest,Test8,no 

ofthebadperformanceofthenetinTest8. inputnodejis0,thereisnoweightcorrection.Thisisprobablythecause localgradientandmostimportantontheinputsignalofnodej.Sowhen latedasindicatedbyequation7.Itdependsonthelearningparameter,the removed.Duringtrainingtheweightcorrectionforahiddennodeiscalcu- 

becausethedierencebetweenthersthalfandthesecondhalfof(apart Thewaveletcoecientsare0whenthereisnobedtransitionatthatspot, becausethereareonlyafewbedboundariesinthewindow.Sometimesa of)thewindowis0.Mostofthecoecientsonhigherdetaillevelsarezero, bedboundaryisnotevendetectedasshowninFigure34. Onthenestdetaillevelthisproblemisquitesevere,becausethesecoecientsonlyseeaverysmallpartoftheinputandthissmallpartisnotlikely 

hiddennodedoesnot\learn"fromthiscoecient. fromthiscoecienttothehiddennodeisnotupdated.Inotherwords,the tocontainabedboundary.Wheneverthebedboundaryisnotdetectedor whenthereisnobedboundaryatall,thecoecientsarezeroandtheweight


Table11:Comparinginputrepresentationswithdierentnumberofwavelet coecients(1). Haar1 architecture numberofweights coecientreductionnone Test8 3128.10.1368.15.1 3861 Test9 

numberofepochs 3091 

trainingerror 2125 220onlevel6 

generalizationerror63.5% 4.1% 210onlevel5 4303 0.8% 51.5% 

Table12:Comparinginputrepresentationswithdierentnumberofwavelet coecients(2). Haar2 architecture numberofweights coecientreduction225onlevel6232onlevel6 Test10 348.10.1 2191 212onlevel5212onlevel5 Test11 

23onlevel423onlevel4 334.15.1 1561 

numberofepochs trainingerror generalizationerror50.3% 4479 1.0% 56.4% 3848 0.9%


b 

Part of input signal that lies 

in sliding window, wavelet 

a 

coefficients on this level 

are (0,(b-a)/2,0,0) 

b 

a 

Input signal after sliding window 

is moved 0.2 feet to the right; 

wavelet coefficients on this level 

are (0,0,0,0) 

discretized input values 

1 2 3 4 5 6 7 8 

Thereareapproximately2bedboundariesperpattern.Notallboundaries Figure34:Notalltransitionsaredetectedbywaveletcoecients. 

aredetectedbythecoecientsonthenestdetaillevel,butwhenitisnot detectedbycoecientciitwillbedetectedbycoecientci1aftermoving 

coefficients 

theslidingwindow.Soonaverage50%oftheboundariesisdetectedon thenestdetaillevel.Thetrainingsetconsistsof4601patterns,sothis meanstheweightcomingfromcoecientcitohiddennodehjisupdated approximately72times(4601patterns,2boundariesperpatternfromwhich 50%isdetectedand64coecients).Ausualtrainingtakesabout2000to 

WhenwecomparetheresultstothenettrainedinTest1,weseethatthe thesenetsismuchbetterthantheperformanceofthenetinTest8. detaillevelandalsofromhigherlevelsandweseethattheperformanceof InTest9,10and11weremoveanumberofthecoecientsonthenest 3000updatesperweight. 

lessweights.Thenumberoftrainingexamplesinproportiontothenumber forthebettergeneralizationperformance,isthatthenetscontainmuch of73%ispossible,withoutreducingthenetperformance.Oneexplanation inputsisreducedfrom3128to368,348and334.Aninputreduction netshavebettertrainingandgeneralizationerrors,althoughthenumberof 

analysisofTest6and7. improvementingeneralizationisbetterthanfortheprincipalcomponent andgeneralizationmorethantheoriginaldiscretizedslidingwindow.The possiblethatthenetndsthisinputrepresentationfacilitatesthetraining ofweightsishigherintheseteststhaninTest1.Butitcouldalsobe

4.4Creatingamorerepresentativetrainingset 53 

4.4Creatingamorerepresentativetrainingset Ourtrainingsetisnotveryrepresentative,becausetheexamplesofadjacent loggingpointslookverysimilar.Thisiscausedbythefactthatweusea 

logandsecondlybyusing\dicult"(non-similar)examples. ited).Therearetwomethodsweemploytoincreasetherepresentativeness ofthetrainingset:rstlybyusingacoarsersamplingperiodforthetarget becausewewantasmuchexamplesperlogaspossible(thedatasetislim- 

densesamplingperiodforthetargetlog.Weusethisdensesamplingperiod 

trainingsetofthesamesizeasouroriginaltrainingsetweuse15models ducesabout300examples.ThisiswhatwedidinTest12.Tocreatea withthissamplingperiod.Thistrainingsetismorerepresentative,because Whenweuseasamplingperiodof3feet,onemodelof1000feetonlypro- 

thenetwork. itcontainsmoredierentexamples.Weexpectthiswillmakeitmoredif- 

Aftertrainingthenet,wehavelookedattheapproximationofthenettoan cultforthenettolearntheproblem.Thenetworkhastondamore 

arbitrarylog.Wefoundthatthehighestrelativeerrorsoccuraroundareas generalsolutiontotalltheexamples.Thisimprovesthegeneralizationof 

wouldbemorerepresentativewhenitcontainedmoreexampleslikethis. Actuallythisissomekindofweighingoftheexamples.Byoeringmore oflowresistivityandareaswithhighresistivitycontrasts.Thetrainingset 

theotherpartsoflogA(withatotalof6036examplesastrainingset).The \dicult"examples,theseexampleswillgetmoreattentionintheminimalizationprocess.InTest13weuseatrainingsetof4600examples,coming 

fromdicultpartsoflogA,PandSand1436coarsersampledexamplesof resultsforthesetestsaregiveninTable13andTable14. Aswecanseefromthesetests,bothmethodsimprovethegeneralization performanceofthenet.Thetraining,however,becomesmorecomplicated. dierentexamplesthatlooklikeexamplesthatarefoundinreality. Itisveryimportanttocreateatrainingsetthatcontainsalargenumberof


Table13:Comparingtrainingsetoftargetlogsandtrainingsetofcoarser sampledtargetlogs. Trainingset1 architecture numberofweights trainingset numberofepochs Test1 3128.10.13128.10.1 3861 modelA Test12 

trainingerror generalizationerror70.0% 3060 3.1% 3861 mixtureof15models 3023 5.6% 26.9% 

partsoftargetlogs. Table14:Comparingtrainingsetoftargetlogsandtrainingsetofdicult Trainingset2 architecture numberofweights trainingset Test1 3128.10.13128.10.1 3861 Test13 

numberofepochs modelA 3861 

trainingerror 3060 dicultexamplesof 

generalizationerror70.0% 3.1% modelsA,PandS 1042 9.8% 30.3%

4.5Intermediateresultsinputrepresentations 55 

4.5Intermediateresultsinputrepresentations 

validationset(14153examples)andthemodelsJ,K,L,M,N,O,P,Q,R Weselectsomeofthenetsfromthepreviousteststotrainthemonalarge 

andSastestset(47485examples). thetrainingsetcontains28331examples.WeusethemodelsG,HandIas modelsA,B,C,D,EandF.Thetargetlogsaresampledevery0.2feet,so trainingset.ThetrainingsetfortheShallow-logisconstructedfromthe 

themodelsthenetistrainedon,theothermodelsareusedforvalidation andtesting.Therstnet,Non-uniform,isanetwithanon-uniformsampledinputandPCAreductionfrom364inputsto35inputsforRtand 

InFigure35theresultsforthreenetsareshown.Therstsixmodelsare 

net,UniformfromTest4,isauniformsampledinputwithawindowof someofthecoecientsfrom3128inputsto368inputs.Andthethird 29.7feetandasamplingperiodof0.3feet,resultingin3100inputs. netwithHaarcoecientsasinputs.Theinputsarereducedbyremoving 20inputsforbothRxoanddxo.Thesecondnet,HaarfromTest9,isa 

InFigure35thepercentageofthelogthathasarelativeerrorbelow5%is andtestingset. resultsonthetrainingset,buttheyallperformequallyoverthevalidation shown. 

Themethodstoreducethenumberofinputsworkverywell.Thenetworks AscanbeseeninFigure35,thenon-uniformsampledinputgivesthebest 

areabletolearntheproblemevenwhenalargenumberoftheinputsis removed.Thegeneralizationperformance,however,isnotsucientandwe havealsolostinformation.Inthenextsectionswewillinvestigatecertain architecturalconstraintsinordertoimproveonthegeneralizationperformance.


percentage of log 

that is correct 

100 

95 

90 

85 

80 

75 

70 

Non-Uniform 

Haar 

Uniform 

65 

Figure35:Intermediateresultsoftrainingthreedierentneuralnetson6 

60 

models. 

55 

50 

45 

training validation testing

4.6Experimentingwitharchitectureconstraints 57 

4.6Experimentingwitharchitectureconstraints Intheprevioustestsweonlyusedfullyconnectednets.Nowwewilllook constraint,becauseitcouldalsomakethelearningmoredicultanddecreasethegeneralizationperformance.Thisoccurswhentheweightsharing 

onlyhelpthetrainingandgeneralizationwhentheweightsharingisalogical consequenceoftheproblem. 

atlocallyconnectednetsandotherconnectionconstraints.Anattractive methodtoreducethenumberofweightsisbyforcingcertainweightsto isforcingthenettondasolutionthatdoesnotttheproblem.Itwill beequal.Ofcoursewehavetobeverycarefulwhenweusethistypeof 

sucientnumberof(dierent)examplesisgiven.Wetraindierentfully 4.6.1Experimentingwithfullyconnectednets Aswehaveseen,thefullyconnectednetscanbetrainedquitewellwhena 

feasibletotrainanetwiththatmanyweightsonalargetrainingset. Wendthattwohiddenlayersimprovethetrainingresults,butitisnot inTest1andTest3.TheresultsofthesetestsaregiveninTable15. connectednetswithuniformsampledinputsandoneortwohiddenlayers 

4.6.2Experimentingwithlocallyconnectednets Theadvantageoflocallyconnectednetsoverfullyconnectednetsisthatthey needmuchlessweightsandtheneuronsinthehiddenlayercanspecializeon theirpartoftheinput.Thesizeandoverlapofthereceptiveeldsarethings 

netfromTest16tothefullyconnectednetfromTest3,wehaveobtained tobedetermined.WehavetriedtwolocallyconnectednetsinTest15and 

aweightreductionof63%.Althoughthetrainingerrorisslightlyworse, Test16.TheresultsaregiveninTable16. numberofhiddennodesonthesecondhiddenlayer.Whenwecomparethe Theresultsarebetterwhenweusesmallreceptiveeldsandasucient 

locallyconnectednetsaremoreattractive(inthisproblem)thanthefully thegeneralization,whichismoreimportant,isalotbetter.Wendthat connectednets. 

inSection2.3.3,inthefullyconnectednet(Test17andTable17),thelocallyconnectednet(Test18andTable18)andthewaveletnets(Test19 

Inthefollowingtestswehavebuildthesymmetryconstraints,asdescribed 4.6.3Usingsymmetryconstraints 

andTable19).


Fullyconnected architecture numberofweights numberofepochs Table15:Comparingdierentfullyconnectednets. 

trainingerror Test1 3128.10.13128.10.10.1 3861 3060 Test3 

generalizationerror70.0% 3.1% 3971 6429 1.1% 65.2% 

Locallyconnected connections architecture numberofweights Table16:Comparingdierentlocallyconnectednets. 

receptiveeldsize Test15 LocallyconnectedLocallyconnected 3128.12.5.1 1523 3128.19.15.1 Test16 

receptiveeldoverlap80% numberofepochs 7.8feet 1475 

trainingerror 2679 3.8feet 

generalizationerror 7.5% 61.7% 48.9% 70% 5267 2.1% 

Table17:Comparingfullyconnectednetswithoutandwithsymmetryconstraints.Symmetry1 

architecture numberofweights symmetryconstraintsNo 3128.10.13128.10.1 3861 Test1 1941 Yes Test17 

numberofepochs trainingerror generalizationerror 3060 70.0% 3.1% 655 6.6% 37.7%

output node 

second hidden layer 

of 15 nodes 

total 


6 feature 

maps 

(copies) 

feature map 



performance.Thisideaofsymmetryconstraintsisworkedoutfurtherin thetrainingmoredicultforthenet,butitimprovesthegeneralization theconvolutional-regressionnets. Inallteststhenumberofweightsisreducedapproximately50%.Thismakes 

4.6.4Experimentingwithconvolutionalregressionnets 

fromthersthiddenlayeranddeterminesthelocationofthefeatureinthe Intheconvolutional-regressionnetsweconstrainallreceptiveeldstoshare input. racyinspatialresolution.Thesecondhiddenlayercombinestheactivations thesameweights.Thesizeandtheoverlapoftheeldsdeterminetheaccu- 

Rt 

denlayerareconnectedtoallthevariablesintheinputlayer. Figure36:Convolutional-regressionnet.Thefeaturemapsinthersthid- 

Rxo 

dxo 

Thenumberoffeaturemapsdeterminehowmanyfeaturescanbedetected. 

window size 

dependuponthenumberoffeaturemapsused(for15hiddennodesin Weshouldnotusetoofewfeaturemaps,otherwisethenethasdicultyin learningtheproblemandingeneralization.Thenumberofweights,however, 

29.8 feet 

receptive field


constraints. Table18:Comparinglocallyconnectednetswithoutandwithsymmetry Symmetry2 connections architecture numberofweights receptiveeldsize Test16 LocallyconnectedLocallyconnected 3128.19.15.1 1475 3128.19.15.1 Test18 

receptiveeldoverlap70% symmetryconstraintsNo 3.8feet 905 

numberofepochs Yes 3.8feet 

trainingerror 5267 70% 

generalizationerror 2.1% 48.9% 37.8% 5536 2.3% 

Table19:Comparing\wavelet"netwithoutandwithsymmetryconstraints. Symmetry3 architecture numberofweights remarks Test9 368.15.1 3091 Test19 

symmetryconstraintsNo WaveletcoecientsWaveletcoecients Reduction20,10 Reduction20,10 Yes 367.15.1 1561 

numberofepochs trainingerror generalizationerror 4303 0.8% 51.5% 46.6% 3137 3.6%


second hidden layer 

of 15 nodes 

total 


9 feature 

maps 

copies 

(3 feature maps) 

feature map 


Figure37:Convolutional-regressionnet.Thersthiddenlayerconsistsof 

Rt 

Rxo 

dxo 

weightsbyk15).Atrade-oshouldbemadebetweenthesizeandoverlap thesecondhiddenlayeronefeaturemapofknodesincreasesthenumberof tooneofthevariablesintheinputlayer. threesetsoffeaturemaps.Eachsetconsistsofthreemapsandisconnected 

window size 

oftheelds,thenumberoffeaturemapstouseandtheresultingnumber 

29.8 feet 

receptive field 

ontheotherhand,thersthiddenlayerwassplitintothreevariablemaps, toallthevariablesoftheinputlayerasshowninFigure36.InTest23, itscorrespondingvariableintheinputlayer.ThisisshowninFigure37. eachcontainingthreefeaturemaps.Eachvariablemapwasconnectedto ofweightsinthenet.InTest20,21and22thehiddenlayerwasconnected 

TheresultsforthesetestsaregiveninTable20andTable21.Forallthese testset.Thegeneralizationperformanceofthesenetsissignicantlybetter tetsthenetsarelocallyconnectedandthewindowsizeis29.8feet.The convolutional-regressionnetsperformverywellonboththetrainingandthe 

generalizationisapproximately2.5timesbetter. thanforallthenetswehavetriedbefore.Whenwecomparetheresults nectednetwithtwolayers),wehaveaweightreductionof29%andthe fromTest21withtheresultsfromTest3forexample(Test3isafullycon-


thebedboundaries.Itsmoothstheinputsignalandtheactivationsare InFigure39theactivationsofthersthiddenlayeratacertaindepthare Fromtheseactivationswecanseethersthiddenlayerisactuallydetecting plotted.TheseactivationscomefromTest21. Wehaveinvestigatedthefeaturesthataredetectedbythersthiddenlayer. 

constantforpartsoftheinputsignalthatareconstant.


Convolutional1 architecture numberofweights samplingperiod Table20:Resultsforconvolutional-regressionnets(1). 

receptiveeldsize Test20 360:69:5:13150:627:15:1 503 0.5feet Test21 

receptiveeldoverlap75% 5.5feet 2827 

gure 0.2feet 

numberofepochs Figure36 3.8feet 

trainingerror 3316 75% 

generalizationerror 3.9% 34.7% Figure36 2865 0.6% 25.5% 

Convolutional2 architecture numberofweights samplingperiod Table21:Resultsforconvolutional-regressionnets(2). 

receptiveeldsize Test22 3100:616:15:13150:927:15:1 1927 0.3feet Test23 

receptiveeldoverlap80% 7.2feet 0.2feet 3.8feet 3865 

gure numberofepochs trainingerror Figure36 4081 Figure37 75% 

generalizationerror 1.6% 26.3% 0.5% 28.6% 3689


Ohm meter 

70 

Rt 

60 

50 

40 

30 

Figure38:TrueresistivityproleofmodelAatdepth615feet. 

20 

10 

0 

600 605 610 620 625 630 

depth 

activation 

1 

0.8 

0.6 

Feature map 1 

Feature map 2 

Feature map 3 

Feature map 4 

Feature map 5 

Feature map 6 

0.4 

0.2 

0 

-0.2 

-0.4 

-0.6 

6featuremapsof27nodeseach. TheseactivationsareformodelAatdepth615feet.Thislayerconsistsof Figure39:ActivationofrsthiddenlayerperfeaturemapforTest21. 

-0.8 

-1 

5 10 20 25 30 35 

node

4.7Intermediateresultsarchitecturedesign 65 

Wetrainaconvolutional-regressionnetonalargersetofmodels.The 4.7Intermediateresultsarchitecturedesign previousnetworks(28331,14153and47485patternsrespectively).Thenet trainingset,validationsetandtestsetisequaltotheoneusedforthe thatisusedisshowninFigure36.Theperformanceincomparisonwiththe netismuchbetterthanforthepreviousnets. Asonecansee,thegeneralizationperformanceoftheconvolutional-regression previousresultsisshowninFigure40. WearegoingtousethistypeofnetfortheapproximationofboththeDeepandtheShallow-log. 

theconvolutional-regressionnets.WetrainanetontheShallow-logand oneontheDeep-log.Weusetheconvolutional-regressionnetfromTest21. Thenetswiththebestperformance(trainingandgeneralizationerror)are 4.8Summaryandresults 

Thisnethasaslidingwindowof29.8feet,sampledevery0.2feet(3150 inputs),27receptiveeldsof3.8feetwith75%overlapand6featuremaps. Weperformthefollowingexperiments: Shallow-log(1):Wetrainaconvolutional-regressionnetonearth 

Table22:Averagerelativeerrorandperformanceforearthmodelswith aresampledevery0.2feet.TheresultsaregiveninTable22. modelswithinvasion(modelsA,B,C,D,EandF).Thetargetlogs 

invasion(Shallow-log). Trainingset Validationsetrelativeerrorpredictedcorrect averagepercentageofsetnumberof 

Testset 4.3% 5.1% 6.2% 82%correct 77%correct 88%correct patterns 14153 47485 28331 

Shallow-log(2):Afterthistrainingweaddmodelswithoutinvasion andsomemodelswithinvasionandrestartthetraining.Forthis arenowsampledevery1.0feet)and40modelswithoutinvasion(target purposeweusethewholetrainingset(seeTable5)(thetargetlogs logswersampledevery0.4feet).TheresultsareshowninTable23. Allmodelscontaineitherinvasionornot.Therefore,theperformance issplitintoinvasionandnoinvasion.


percentage of log 

that is correct 

100 

95 

90 

85 

80 

75 

70 

Non-Uniform 

Haar 

Uniform 

Convolutional-regression 

65 

Figure40:Intermediateresults(2)oftrainingdierentneuralnetson6 

60 

models. 

55 

50 

Table23:Averagerelativeerrorandperformanceforearthmodelswithand 

45 

withoutinvasion(Shallow-log). 

training validation testing 

relativeerrorpredictedcorrect averagepercentageofsetnumberof 5.3% 76%correct patterns 

noinvasionTrainingset Validationset 5.5% 75%correct 61688 

Testset 5.6% 77%correct 74%correct 14153 14128 

10040

4.8Summaryandresults 67 

Deep-log:Wetrainaconvolutional-regressionnetonbothmodels 

Table24:Averagerelativeerrorandperformanceforearthmodelswithand withandwithoutinvasion.Weusethesametrainingsetasforthe 

withoutinvasion(Deep-log). Shallow-log.TheresultsaregiveninTable24. 

relativeerrorpredictedcorrect averagepercentageofsetnumberof 8.4% 52%correct patterns 

noinvasionTrainingset Validationset 8.8% 9.0% 51%correct 50%correct 61688 

Testset 7.0% 6.9% 56%correct 58%correct 14153 14128 

scratch.ThenetthatwastrainedontheShallow-logwasoptimalformodels Thelasttwonetsarenotoptimalinasensethattheyarenottrainedfrom10040 

invasion.ThenetthatwastrainedontheDeep-logusedtheoptimalsetof withinvasion.Thenweaddedmodelswithoutinvasionandnewmodelswith weightsfromtherstnetthatwastrainedontheShallow-log(thiswas weights)anddirectlyusingalargetrainingset. resultscanbeachievedbyretrainingthenets(randominitializationofthe andcontinueditstrainingonnewmodelswithandwithoutinvasion.Better donebecausetheShallow-logandDeep-loglookverysimilarinpractice)

68 5.THENEURALNETWORKASFASTFORWARDMODEL 

5Theneuralnetworkasfastforwardmodel wardmodels.Theaccuracyisnotyetsucient,butcanbeimprovedby Inthissectionwepresenttheneuralnetworksthatcanbeusedasfastfor- 

trainingonamorerepresentativetrainingset.Foreachnetthe\inputdomain"isgiven.Thisisadescriptionofthemodelsthenetistrainedon. 

Forearthmodelssimilartothesetrainingmodels,thenetgivesanapproximationwithanaccuracythatisalsogiveninthisdescription.Whenan 

of(apartof)theearthmodel(onlyformodelswithinvasion),theapproximationofthelogbytheneuralnetandacumulativeerrorplot.Inthis 

cumulativeerrorplotthepercentageofthelogthathasarelativeerrorbelowR%(forR=0:::25)isgiven.Inallplotstheinvasionradiusisgivenin 

earthmodelliesoutsidethisdomain,onecannotexpectthenettoperform optimal. Foreachnetthebestandworsttestresultisgiven.Theseresultsconsist 

AdescriptionofthisnetcanbefoundinChapter3.Thenetisfullyconnected.TheinputdomainforthenetthatistrainedontheDeep-andthe 

net toolresponse sizeslidingwindow sizetrainingset rangeRt rangebedsize Deep-logandShallow-log 

5.1Applicationtoearthmodelswithoutinvasion 0.1inchandtheresistivitiesinm. 

Shallow-log,herecalledNoInvasion,isgiveninthefollowingtable: 

speed 1...70m 1...5feet 11797examples 400pnt/sec 15feet 

InFigures41and42theworstandbesttestresultsareshownforthe accuracyDeep-log accuracyShallow-log 2.2% 

Deep-logapproximationofthenetNoInvasion.Fromthecumulativeerror 6.0% 

plotsfortheseapproximationsonecanseethatthedeviationinperformance 5%,butinthebestcasethisis100%. isquitelarge.Intheworstcaseonly8%oftheloghasarelativeerrorbelow InFigures43and44theworstandbesttestresultsareshownforthe manceismuchlessthaninthecaseoftheDeep-log.Intheworstcase75% Shallow-logapproximationofthenetNoInvasion.Thedeviationinperfor- 

oftheloghasarelativeerrorbelow5%andinthebestcasethisis100%.

5.1Applicationtoearthmodelswithoutinvasion 69 

100 


Deep-log 

Approximation by neural net 

10 

1 

0 5 10 15 20 25 30 35 40 45 50 

Percentage of log 

with relative error 

below R % 

100 

90 

80 

70 

60 

50 

40 

30 

Figure41:WorstcaseneuralnetapproximationofDeep-log.Averagerelativeerroris14.2%. 

20 

10 

0 

0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 

Relative error R


100 


Deep-log 


10 

1 

0 5 10 15 20 25 30 35 40 45 50 



below R % 

100 

90 

80 

70 

60 

50 

40 

30 

Figure42:BestcaseneuralnetapproximationofDeep-log.Averagerelative erroris2.6%. 

20 

10 

0 

0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 

Relative error R

5.1Applicationtoearthmodelswithoutinvasion 71 

100 


Shallow-log 


10 

1 

0 5 10 15 20 25 30 35 40 45 50 



below R % 

100 

90 

80 

70 

60 

50 

40 

30 

Figure43:WorstcaseneuralnetapproximationofShallow-log.Average relativeerroris8.5%. 

20 

10 

0 

0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 

Relative error R


100 


Shallow-log 


10 

1 

0 5 10 15 20 25 30 35 40 45 50 



below R % 

100 

90 

80 

70 

60 

50 

40 

30 

Figure44:BestcaseneuralnetapproximationofShallow-log.Average relativeerroris1.2%. 

20 

10 

0 

0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 

Relative error R

5.2Applicationtoearthmodelswithinvasion 73 

Shallow-log.Itistrainedonearthmodelsasdescribedinthefollowingtable: regressionnet.Thefollowingnetonlyproducesanapproximationofthe AdescriptionofthisnetcanbefoundinChapter4.Thisisaconvolutional- 

5.2Applicationtoearthmodelswithinvasion 

net toolresponse sizeslidingwindow sizetrainingset rangeRt Invasion 

rangeRxo rangedxo 0.5...2.5m 1...71m 8...50inch 28331examples 30feet 

rangebedsize speed 1...20feet 

InFigure45twoofthetestmodelsareshown.Themodelatthetopis accuracyShallow-log 6.2% 90pnt/sec 

Inthecumulativeerrorplotoftherstapproximationisshownthat70%of thelogisapproximatedwitharelativeerrorbelow5%.Inthecumulative proximationofthenetInvasionareshown. dicultfortheneuralnetandthemodelatthebottomrelativelyeasy.In Figures46and47theworstandbesttestresultsfortheShallow-logap- 

errorplotofthesecondapproximationonecanseethatinthebestcase 83%ofthelogisapproximatedwitharelativeerrorbelow5%. Thisnetworkisoptimalfortheearthmodelsitwastrainedon. 

on,areacombinationofthepreviousmodelswithoutandwithinvasion.A remarkshouldbemadethatthevalueRxowastakentobe1.1mwhen Thedomainofearthmodelstheconvolutional-regressionnetFMistrained 5.3Applicationtorealisticearthmodel 

fromarealoilwell.Thismodelismorerealistic.Itdoesnotlieintheinput dxowas4.25inch.InFigure48aformationmodelisshownthatcomes domain,becauseitcontainsbothpartswithandwithoutinvasion.Inour trainingseteachmodelonlycontainedeitherinvasionornot.Thebedsare relativelysmall.TheDeep-logandShallow-loglookverysimilarforthis donotexpectthenettobeoptimalforthismodel. thismodellooksquitedierentfromthemodelsinthetrainingsetandwe model,thisisalsodierentfromthemodelsthenetistrainedon.Allinall


Adescriptionoftheearthmodelsthenetwastrainedonisgiveninthe followingtable: net toolresponse sizeslidingwindow sizetrainingset rangeRt Deep-logandShallow-log 61688examples FM 

rangeRxo rangedxo 4.25,8...50inch 0.5...2.5m 1...71m 30feet 

rangebedsize speed accuracyShallow-log accuracyDeep-log 1...20feet 8.0% 9.0% 85pnt/sec 

imationoftheShallow-logisshowninFigure50.Aswecanseefromthe TheapproximationoftheDeep-logisshowninFigure49andtheapprox- 

cumulativeerrorplots,therelativeerrorbetweentheneuralnetresponse andtheDeep-logisbelow5%for67%ofthelog.For88%ofthelogthe relativeerrorliesbelow10%. TheShallow-logapproximationismuchbetter.Here77%oftheloghasa relativeerrorbelow5%.

5.3Applicationtorealisticearthmodel 75 

100 



Invasion radius dxo 

10 

1 

200 210 220 230 240 250 260 270 280 290 300 

100 




10 

Figure45:ExamplesofearthmodelsusedforthenetInvasion. 

1 

200 210 220 230 240 250 260 270 280 290 300


Shallow-log 


100 

10 

1 

200 210 220 230 240 250 260 270 280 290 300 



below R % 

100 

90 

80 

70 

60 

50 

40 

30 

Figure46:WorstcaseneuralnetapproximationofShallow-log.Average relativeerroris7.5%. 

20 

10 

0 

0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 

Relative error R


Shallow-log 


100 

10 

1 

200 210 220 230 240 250 260 270 280 290 300 



below R % 

100 

90 

80 

70 

60 

50 

40 

30 

Figure47:BestcaseneuralnetapproximationofShallow-log.Average relativeerroris4.5%. 

20 

10 

0 

0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 

Relative error R





100 

Figure48:A(realistic)earthmodel. 

10 

1 

5400 5410 5420 5430 5440 5450 5460 5470 5480 5490 5500


Deep-log 


100 

10 

1 

5400 5410 5420 5430 5440 5450 5460 5470 5480 5490 5500 



below R % 

100 

90 

80 

70 

60 

50 

40 

30 

Figure49:NeuralnetapproximationofDeep-log.Averagerelativeerroris 7.6%. 

20 

10 

0 

0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 

Relative error R


Shallow-log 


100 

10 

1 

5400 5410 5420 5430 5440 5450 5460 5470 5480 5490 5500 



below R % 

100 

90 

80 

70 

60 

50 

40 

30 

Figure50:NeuralnetapproximationofShallow-log.Averagerelativeerror is8.3%. 

20 

10 

0 

0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 

Relative error R

6Conclusions Inthissectionwepresenttheconclusionsfollowingfromtheexperimentswe haveperformedandthemethodswehaveinvestigated.Theconclusionsare splitintothreeparts 1.Analconclusionaboutthegoalofthisproject:investigatingwhether 2.Conclusionsabouttheusedmethods.Thisinvolvestheinputrepresentation,thetechniquesthatareusedtoreducethenumberofinputs 

itisfeasibletouseaneuralnetworkintheforwardmodellingprocess. andthearchitecturalconstraints. 3.Conclusionsabouttheuseoftheneuralnetworkintrainingandtesting.Theseconclusionsareusefulinfurtherinvestigationandinthe 

Intherstpartoftheprojectweonlyusedearthmodelswithoutinvasion 6.1Neuralnetworkasfastforwardmodel? useofthetrainednetworkinotherapplications. 

typesofearthmodelsneuralnetsweretrainedwiththefollowingresults: andinthesecondpartweonlyusedearthmodelswithinvasion.Forboth Noinvasion:Weusea\standard"fullyconnectednetwithonehiddenlayercontaining15nodes,aslidingwindowof15feetandasamplingperiodof0.2feet(75inputs).Thenetworkhasmoretrouble 

inlearningtheDeep-logthantheShallow-log.Thisiscausedbythe shoulderbedeect,whichismorepronouncedintheDeep-logthanin theShallow-log.Theperformanceisgiveninthefollowingtable TargetlogDataset Shallow-logTrainingsetwithrelativeerrorrelativeerror 

percentageoflog below5%. 98% average 

Deep-log Testset 96% 62% 61% 1.7% 2.2% 5.1% 6.0% 

81

82 6.CONCLUSIONS 

Invasion:Wetrainedoneconvolutional-regressionnetontheShallowlog.Thenetusesaslidingwindowof29.8feet,sampledevery0.2feet, 

Theperformanceisgiveninthefollowingtable TargetlogDataset Shallow-logTrainingset withrelativeerrorrelativeerror percentageoflog below5%. 88% average 27receptiveeldsof3.8feetwith75%overlapand6featuremaps. 

Validationset 82% 4.3% 

Mixedinvasionandnoinvasion:Weusedthesameconvolutionalregressionnetasintheprevioustestswithatrainingsetconsistingof 

6.2% Testset 77% 5.1% 

wetrainedthenetonboththeDeep-andtheShallow-log.Thesenets earthmodelswithandwithoutinvasion(butnotmixed).Thistime 

modelswithandwithoutinvasion Inthefollowingtabletheperformanceofthenetsaregivenforthe arenotoptimal,becauserstthenetwasminimalizedonmodelswith invasionandthenweaddedmoremodelswithandwithoutinvasion. Targetlog Shallow-loginvasion Dataset relativeerror below5%. percentageaverage oflogwithrelative 76% error 

noinvasionTrainingset Validationset 75% 77% 5.5% 

5.3% 

Deep-log 74% 52% 5.6% 

noinvasionTrainingset Validationset Testset 51% 50% 58% 56% 8.8% 9.0% 6.9% 8.4% 7.0%

6.2Methods 83 

Realloggingdata:Realearthmodelscontainbothlayerswithand earthmodel.Theperformanceofthesenetsontherealloggingdata isTargetlog themixedinvasionandnoinvasionpartweretestedonarealistic withoutinvasion.Thetrainedconvolutional-regressionnetworksfrom 

Shallow-logwithrelativeerrorrelativeerror 

percentageoflog below5%. 77% average 

Deep-log 67% 8.3% 

this(partlywithandpartlywithoutinvasion)andbychoosingthe Thenetworkperformancecanbeimprovedbyusingmoremodelslike 7.6% 

Approximationtime:Thegoaloftheprojectistocreateafaster thantheforwardmodelthatispresentlyusedatKSEPL(bothtimes forwardmodel.Theneuralnetworkisapproximately100timesfaster invasionradiusbetween4and25inchinsteadof8and50inch. 

Althoughtheaccuracyoftheapproximationstillneedssomeimprovement, thenetworkisalotfasterthantheforwardmodelthatisusednow.We measuredonanIBMR6000workstation).Thiscouldstillbeimproved 

concludethatitisfeasibletouseaneuralnetworkintheforwardmodelling byoptimizationoftheneuralnetcalculations. 

process.Evenwithlessaccuracytheneuralnetworkcouldbeusedinthe rstiterationsoftheforwardmodellingprocess.Inthatwayafairlygood initialguesscanbemadeveryquickly.Thenonecanusethemoreaccurate forwardmodel. 6.2Methods 

whereweusedearthmodelswithinvasion. Intherstpartoftheprojectweonlyusedfullyconnectednetswithvarying window.Moreinterestingresultsarefoundinthesecondpartoftheproject, numberoflayers,hiddennodesandvariationsinthesizeofthesliding 

6.2.1Inputrepresentation Weexperimentedwithtwoinputrepresentation.Therst,thediscretized andhasbeenusedinfurthertests: slidingwindowapproach,gavethebestresults(trainingandgeneralization) Discretizedslidingwindow:Weusedauniformsampledsliding windowasinputtotheneuralnet.Thenumberofinputscomingfrom


Attributes:Againweusedaxedsizeslidingwindow,butnowwe well. trainingresultwasgood,butthenetworkwasnotabletogeneralize thiswindowcanbequitehigh,especiallyinthecasewithinvasion.The 

describethebedsthatoccurinthiswindow.Eachbedwasdescribed byanumberofattributes.Theproblem,however,wasnotdescribed 

6.2.2Inputreduction task. wellbytheseattributes.Tondappropriateattributesisadicult 

Allpreprocessingmethodsweusedweresuccessful.Weachievedahigh inputreductionwithoutlossofperformance(measuredintrainingandgeneralizationresults): 

Samplingmethod:Theslidingwindowcanbesampleduniformand plingperiods.Acoarsesamplingperiodresultsinlessinputsandthe non-uniform.Intherstcaseweexperimentedwithdierentsam- 

fromtheedges.Non-uniformsamplingresultsinlessinputsandthe receivesmostofitsinformationfromthecenterofthewindowandless ariesaredescribedlessaccurately.Thenon-uniformsamplingmethod workedverywell.Thismethodisbasedonthetoolphysics.Thetool trainingandgeneralizationresultswerecomparable.Thebedbound- 

Principalcomponents:WiththeprojectionoftheN-dimensional generalizationperformanceofthenetwasbetterthanforthenetthat usedauniform-sampledslidingwindowasinput.Hereweonlyloose accuracyattheedgesofthewindow. 

tionoftheinputwaslost(measuredintherelativedistancebetween originaldiscretizedinput,althoughapproximately7%oftheinforma- 

resultswerecomparableandthenetworkgeneralizedbetterthanthe components,aninputreductionof75%wasachieved.Thetraining inputvectortoaM-dimensionalvector,describedbyMprincipal 

Haartransform:Theadvantageoftransformationoftheoriginal cientscanberemoved.Areductionof73%ofthenumberofinputs inputvaluestothewaveletcoecientsisthatanumberofthecoe- 

theoriginalandprojectedinput). 

wasachievedwithgoodtrainingandgeneralizationresults.Byremovingcoecientsattheedgesofthewindow,weloosesomeaccuracy. 

lessaccurately. Thebedboundariesoutsidethecenterofthewindowaredescribed

6.3Applicationoftheconvolutional-regressionnet 85 

6.2.3Architecturedesign Fullyconnectednets:Inthecasewithoutinvasionafullyconnected 

resultinginalongtrainingtimeandstorageandmemoryproblems. generalizewell.Thenumberofconnectionsinthisnetisveryhigh, threeparameterstodescribethebed.Thistypeofnetworkdoesnot netcanbeused.Thegeneralizationperformanceandthetraining 

Preprocessingmethodscanbeusedtoreducethenumberofinputs. resultsarecomparable.Whenthebedscontaininvasion,weneed 

Locallyconnectednets:Aneuralthatislocallyconnectedhas fullyconnectednets.Thesizeofthereceptiveeldsandtheoverlap largeoverlap.Inthiswayitiseasierforthenettodeterminethe nets.Thetrainingtimeneededforthesenetsisshorterthanforthe arediculttodetermine.Wechoose,however,forsmalleldswith muchlessweightsandbettergeneralizationthanthefullyconnected 

Symmetryconstraints:Aninputsignalanditsmirrorimagegive preciselocationofafeaturethatoccursinoneoftheelds.Thisnet needsatleasttwohiddenlayers:onefordetectingthelocalfeatures andoneforcombiningthefoundfeatures. thesametoolresponseandshouldthereforealsogivethesamenetworkresponse.Thisrequirementleadstocertainweightconstraintduced,becausecertainweightsare\shared"(equal).Thereduction 

inthefully,locallyandwaveletnets.Thenumberofweightsisre- 

Convolutional-regressionnets:Intheconvolutional-regressionnet inthenumberofweightsmakesthetrainingmoredicult,butitimprovesthegeneralizationperformanceincomparisonwiththesame 

allreceptiveeldssharethesameweights.Onegroupofhiddennodes withsharedweightsiscalledafeaturemap.Thereductionoffreedom netswithouttheseconstraints. 

setofsharedweights,thatishaslearneditself.Thegeneralization performanceofthistypeofnetisbetterthanforalltheothernetswe iscompensatedbyanincreaseinthenumberoffeaturemaps.This netperformsaconvolutionontheinputwithaconvolutionkernel,the 

Whentheconvolutional-regressionnetisgoingtobeusedinotherapplications,thefollowingaspectsareimportant: 

Sizeoftheslidingwindow:Thesizeoftheslidingwindowdepends ontheapplication.Oneshouldhaveanideaofhowmuchoftheinput 

6.3Applicationoftheconvolutional-regressionnet havetried. 

logisresponsiblefortheoutput.


Samplingperiodofslidingwindow:Thesamplingperiodofthe 

Receptiveelds:Thesizeandoverlapofthereceptiveeldsdeterminetheaccuracyofthelocationofthefeaturesintheinput.Use 

goodindicationistousethesamesamplingperiodasforthetarget slidingwindowdetermineshowaccuratetheinputisdescribed.A 

smalleldswithhighoverlap.Thenumberofweights,however,dependonthenumberofreceptiveelds.Choosethesizeoftheelds 

andtheoverlapsothatthenumberofweightsisnottoolarge. thetargetlogs.Donottakeatoodensesamplingofthetargetlog, otherwisemostexampleslooktoomuchalike.Itisbettertotakea 

log. 

Representativetrainingset:Thetrainingsetisconstructedfrom 

Sizetrainingset:Thenumberoftrainingexamplesthatareneeded (very)coarsesamplingperiodandusealargenumberofdierentlogs. examples.Createmodelsthatarelikelytobefoundinreality. Inthiswaythetrainingsetconsistsofalargenumberofverydierent inordertogetgoodgeneralizationperformancedependsuponthe 

Minimalizationnetworkerror:Theerrorthatthenetworkisminimalizingcanbeadaptedtotherequirementsoftheapproximation. 

thenumberofweightsasexamples. ahighnumberofdierentexamples,oneneedsapproximately10times representativenessofthetrainingset.Whenthetrainingsetcontains 

Withthecombinedlogarithmicandnormalizationscalingtheneural 

morevariablesthanRt,Rxoanddxo.Whenthereareforexampledipping Theconvolutional-regressionnetcaneasilybeadaptedforproblemswith andaistheactualoutput. netisminimalizingtheproportiond=a,wheredisthedesiredoutput 

intheformation.Thescalingofanewvariableneedsnewinvestigation. Theboreholeradiusandresistivityofthedrillinguidcanalsobeadded asvariables,butlikethedipangle,thisshouldonlybedonewhenthereare point).Thisisonlypossiblewhenthereisadierentdipangleforeachbed layers,thedipanglecanbeaddedasfourthvariable(foreachsampling 

enoughdierentvalues.

ANeuralnetworksimulators (vanCamp1993).Wealsotriedothernetworksimulators,StuttgartNeural InthisprojectweusedtheXerionnetworksimulatorversions3.1and4.0 NetworkSimulator(SNNS)(Zell1993)andAspirin/MIGRAINES(Leighton designonanylevel:layers,nodesandevenconnections.Thisisveryuseful 1992),butwefoundthatXerionprovidesmorefreedominspecifyingarchitectureconstraints.TheXerionsimulatorallowsyoutoalterthenetwork 

fortheimplementationoftheconvolutional-regressionnets,whereweconstraincertainweightstobeequal. 

optionfortimedelayedneuralnets(seeSection2.3.5).Alayerisspecied byitsnumberoffeatures(orvariables)anditstotaldelaylength(numberof TheSNNSsimulatorisabeautifulgraphicalsimulatorandhasaspecial 

Anditisalsonotpossibletochangetheoverlapofthereceptiveelds.We Itis,however,notpossibletoconstrainarbitraryweightstobethesame. eldsarespeciedbytheirsize(delaylength)andalwayshavemaximum overlap(displacementofonenode).Allreceptiveeldshavesharedweights. nodesinslidingwindow).Layerscanbefullyorlocallyconnected.Receptive 

presenttheinputsasasequencewithinaslidingwindow,butfortheSNNS 

periods.Thedelaylengthspecieshowmanyinputsbeforethisinputaect simulatoroneshouldpresenttheinputsatonespecicloggingpoint.The 

theoutput.Onecannotspecifyhowmanyinputsafterthisinputaectthe inourcase.Thereareonly4valuesperpattern(Rt,Rxo,dxoandthe advantageofthisisthatthelethatcontainsthedataismuchsmallerthan 

output.Sotherearethreemainpointswhythissimulatorisnotsuitable forourpurposes: toolresponse)insteadof3(ws+1)forawindowsizewandasampling 

3.Onecanonlyspecifyhowmanyinputsbefore(andnotafter)the 2.Theoverlapofthereceptiveeldsisalwaysmaximal. 1.Itisnotpossibletospecifyarbitraryweightconstraints. 

Aspirin/MIGRAINESisacompiler.Itcompilesyourcodeintoaworkingneuralnetworksimulator(inC).Inthesourcecodetheneuralnetis 

Itisnotpossibletospecifyconvolutional-regressionnetswiththissimulator. currentinputaecttheoutput. 

layer,thesizeofthelayer(innodes)andconnectioninformation(howthe speciedbyitscomponents.Acomponentisdescribedbythenameofthe calledSharedTessellationsandonecanspecifythesizeoftheeldsandthe overlapinthex-andy-direction.Itisnotpossible,however,toconstrain layerisconnectedtootherlayers).Receptiveeldswithsharedweightsare specicweightstobeshared,whichweneededinthesymmetryexperiments. 

I

II A.NEURALNETWORKSIMULATORS 

Itispossibletocreateconvolutional-regressionnetswiththissimulator. Althoughreceptiveeldsareeasierdenedbytheprevioussimulators,we haveusedtheXerionsimulator.Allhiddenneuronsinonefeaturemap areconnectedbynmlinkstotheircorrespondingreceptiveeld(ofsize nm).Alltheseconnectionshavetobespecied,whichcanmountupto 1620connectionsintheconvolutional-regressionnet.Neuronscanbeconstrainedtohavethesameincominglinks,whichiswhatwedidperfeature 

readingofthisleandthebuildingofthenetisquiteslow,butitoutweighs thelackoffreedomoftheothersimulators. weights.OurXeriontopologylescontainedatotalof9951lines.The map.Thenthenodesinonefeaturemapareconstrainedtohavethesame

REFERENCES 

References Anderson,B.&Barber,T.(1990).Modellingelectromagnetictoolresponse, 

Chemali,R.,Gianzero,S.&Strickland,R.(1983).Theshoulderbedeffectontheduallaterologanditsvariationwiththeresistivityofthe 

Asquith,G.(1982).Basicwellloganalysisforgeologists. OileldReviewpp.22{32.Welllogging. 

Chemali,R.,Gianzero,S.&Su,S.(1988).Theduallaterologincommon complexsituations,SPWLA29thannualloggingsymposium. boreholeuid,SPWLA24thannualloggingsymposium. 

Gianzero,S.(1977).Characteristicresponsesofresistivitytoolsinelliptical Gianzero,S.,Lin,Y.&Su,S.(1985).Anewhigh-speedhybridtechnique boreholes,IEEEtransactionsongeoscienceelectronicsGE-15(4):251{ forsimulationandinversionofresistivitylogs. 256. 

Guyon,I.(1991).Neuralnetworksandapplicationstutorial,PhysicsReports Haykin,S.(1994).NeuralNetworks,AComprehensiveFoundation,MacmillanCollegePublisingCompany,Inc. 

(ReviewsectionofPhysicsLetters)207(3{5):215{259.ReviewSection ofPhysicalLetters207. 

Hertz,J.,Krogh,A.&Palmer,R.(1991).Introductiontothetheoryof Leighton,R.R.(1992).TheAspirin/MIGRAINESNeuralNetworkSoftware,v6.0edn. 

neuralcomputation,Addison-WesleyPublishingCompany. 

Strang,G.(1989).Waveletsanddilationequations:Abriefintroduction, Moran,J.(1985).Focusedresistivitylogs,Developmentsingeophysical SIAMReview31(4). explorationmethods6:225{260. 

vanCamp,D.(1993).AUsersGuideforTheXerionNeuralNetworkSimulator,ver3.1&ver4.0edn,UniversityofToronto. 

Waibel,A.,Hanazawa,T.,Hinton,G.,Shikano,K.&Lang,K.(1989). Phonerecognitionusingtime-delayneuralnetworks,IEEETransactionsonacoustics,speechandsignalprocessing37(3). 

Zell,A.(1993).SNNSStuttgartNeuralNetworkSimulator,v3.1edn,InstituteforParallelandDistributedHighPerformanceSystems. 

III

Forward Modelling of Resistivity Logs using Neural Networks - Liacs

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?