studiind log-ul am gasit urmatoarele:
2006-03-15 10:10:06 69.49.230.8 - 82.79.34.18 80 GET /index.asp - 200 - -
2006-03-15 10:27:54 66.249.65.16 - 82.79.34.18 80 HEAD /pics/catel.jpg - 404 Googlebot-Image/1.0 -
dupa care urmeaza o succesiune de "IP de google... HEAD ... 404 Googlebot-Image/1.0" la diferite intervale de timp (1h, 2h, 5h). fisierele pe care le cauta google cu HEAD sunt din vechea pagina care nu mai este de 2 luni.
69.49.230.8 apartine:
OrgName: Hosting-Network GmbH
OrgID: HOSTI-3
Address: 247 Mitch Lane
City: Hopkinsville
StateProv: KY
PostalCode: 42240
Country: US
observati metoda HEAD in loc de GET.
am cautat diferentele intre HEAD si GET si iata ce am gasit:
"The HEAD method is identical to GET except that the server MUST NOT return a message-body in the response. The metainformation contained in the HTTP headers in response to a HEAD request SHOULD be identical to the information sent in response to a GET request. This method can be used for obtaining metainformation about the entity implied by the request without transferring the entity-body itself. This method is often used for testing hypertext links for validity, accessibility, and recent modification.
The response to a HEAD request MAY be cacheable in the sense that the information contained in the response MAY be used to update a previously cached entity from that resource. If the new field values indicate that the cached entity differs from the current entity (as would be indicated by a change in Content-Length, Content-MD5, ETag or Last-Modified), then the cache MUST treat the cache entry as stale. "
sursa: http://www.w3.org
deci am tras concluzia ca googlebot a vrut numai sa vada daca mai sunt actuale referintele pe care le avea din vechea pagina si a vazut ca nu sunt. insa de azi pagina mea www.klin.ro nu mai apare pe pozitia fruntasa pe care o ocupa pana ieri dupa "incaltaminte copii". daca am dat cache:www.klin.ro am vazut varianta din 9 martie, dar dupa cum am mai lamurit pe alte threaduri din forum, exista mai multe datacentere ale google si se pare ca fiecare are alta varianta.
intrebare: are legatura .. HEAD .. 404 Googlebot-Image/1.0 cu faptul ca am cazut din search sau nu? fisierele care le cauta sunt mult mai vechi decat versiunea pe care o indexase ultima data.