I spent the last few days fixing the site map that is generated from running Xenu. The generated site map starts with a great many redundant page listings, so I have to go in and remove a lot of them. The current page size for the sitemap is 505kB, while the one already on the web is 462kB.
I'll go ahead and upload the new map and up-date over the rest of the week. My system indicates 1143 html files [pages], as does Google. Google indicates I have 1444 indexed urls, and a total of 1606 total URLs. The difference is due to a number of orphan pages that capture misspelled page addresses.
Any way the 80 odd pages that were generated over the last few months are now part of the sitemap, and now have an external page link that points to them.
7 comments:
3/11/08; the sitemap file has been reduced to 492kB, or down 13kB, a lot for a text file. I'll post the next up-date.
3/14/08; now the site map is at 423kBytes, or down another 70k Bytes.
3/17/08 Now the site-map is down to 367k Bytes. Down another 60 KB. It still needs some work, and I would like to get it down near 300k by the end of the week.
I need to be careful and only deleted redundant links and not mess up the tree structure of the map.
It should load faster now.
3/17/08 Still the same day as the last comment, but it's the best I could do. The site-map is now down to 304k Bytes. As the site map becomes more optimized it becomes harder to delete redundant page listings.
Some times the software program places the link off-page, or picks a non-important page; err, the link is there, but what page is the tab coming from.
Any why; you have to edit the page to understand; so the better the page gets the harder it is to update it, or you run the risk of deleting page addresses that do not reside under another listing.
What ever it's down another 60 kilo bytes, and that's with my main computer not working, seems the hard drive is clicking and will not load the OS.
This may be the most efficient sitemap todate.
4/1/08; Now the physical size of the sitemap is down to 264k bytes, or about half the size it started at. The new pages added to the web site over the last few weeks have also been loaded into the sitemap, so it is also up to date.
4/26/08 Even as new links are being added to the site map, the sitemap is still be reducing in size. The current size of the site-map is 258kBytes
5/16/08 Now down to 253K bytes, even as I add more listings. Another 5k bytes of reduction in text.
Post a Comment