Please Donate £2



Click On DONATE
To Keep This Site Alive

Net Hosted

Net Hosted (.co.uk) is a first class Web Hosting Provider that have all the necessary features to host your website. They are the company hosting this website. I know they have Excellent Customer Support.

CREATION

Main web page of Website Creation Help.INDEX - Main WCH Page
Reasons why you should invest in a website.Why Invest In A Website?
Explains the important features you should look for in a web hosting package before buying a website.Research Host Features
How to research and view a Domain Name through the WhoIs lookup service. Advice on choosing a Domain Name.Domain Name Advice
Basic keyword advice/tips and tools/services that can help with your Keyword/Keyphrase research.Keyword Research
Explains some types of software needed for website creation and their costs.Website Software Costs
Some common Web Terminology (Website Jargon) explained.Website Terminology
Explains bandwidth, bandwidth usage and bandwidth distribution.Bandwidth And Usage
How to write a robotstxt.txt text file. Basic robotstxt instructions explained.Robotstxt Text File
An explanation of common Meta Tags.Meta Tags Explained
Website and Internet Jargon (Terminology) Explained.Website & Net Jargon
Free (Must Have) Downloads - (Free/Commercial Software/PHP Scripts).FREE Software & Scripts

 

CPanel

How to Login to the CPanel Control Panel - A description of CPanel Tools.CPanel Login & Tools
How to create a website email account (email address). Make your website business/project look more professional.Create Email Addresses
How to create an Auto-Response (Automatic Reply) Email. Avoid losing potential customers.Create Auto-Reply Email
How to create an empty MySQL Database with Privileges. Ideal for Personal Databases and for use with PHP Scripts.Create MySQL Database
How to create a table for an empty MySQL Database and then fill that table with records (rows of data). Also demonstrates the VTY Database Manager (PHP Script).Create MySQL Tables
How to create a FTP User Account (File Sharing Folder). People can upload/download files to/from your website (FTP Server).Create FTP, User, Account
How to create a Password Protected Directory (Folder). Ideal for Membership purposes.Create A Password Folder
How to obtain an EPP (Domain Name Transfer) Code. How to create a new website, folder, from an ADD-ON Domain Name.ADD-ON A Domain Name
How to park a domain name and redirect it to an existing domain name. Masking (Cloaking) explained.Park A Domain Name
How to create a subdomain - A shortened URL (website address) that is quicker to type into a web browser.Create Subdomain Name
How to configure a domain name for use on a different web hosting package (on different website servers/computers), so you can have domain names registered elsewhere but host them on your website.Change Name Servers

 

FEATURES

How to create a membership account, registration, login website using PHP Login Scripts and a MySQL Database.Create A Membership Site
How to create a PayPal PREMIER Account, so you can buy and sell goods online.Create A PayPal Account
How to create a PayPal BUY NOW button. An explanation of common button settings.Create A PayPal Button
How to create a Google Adsense (Advertisement Scheme) Account. Earn money from advertisements shown on your website.Create Adsense Account
How to create Google Adsense Advertisements (Adsense Units and Channels). Earn money from advertisements shown on your website. Monitor their success and failure!Create Adsense Adverts
How to create a sitemap xml file for the search engines, so they can crawl your unknown, newly uploaded, web pages.Create Sitemap XML File
How to manually install wordpress (download wordpress .zip file, set up MySQL Database configuration and upload wordpress files) to create a blog.1) Install Wordpress Blog
How to write and publish an article (post). How to create a Category for that post. General Dashboard (Control Panel) features/settings explained and/or exampled.2) Write An Article (Post)
How to import media into an article (post), create a link, create a page, edit a comment and more.3) Media, Link, Page, Etc
How to download and install a new wordpress blog theme. How to change the default wordpress blog theme (appearance).4) Change Default Theme
How to download and install the Forum (Questions & Answers Board) software called phpBB.1) Install A phpBB Forum
How to set up forum notice boards (questions and answers forums) with permissions.2) Set Up Forum Boards

 

TRAFFIC

Methods of investigating your competition's website, traffic, history.View A Website's History
Gives ideas for generating, more, traffic and reasons why existing traffic might be Curious Visitors only as opposed to being Interested, Buying, Visitors.Ways To Get More Traffic
How to submit a Website's URL (website address) to search engines, so that they can start indexing its content.Submit To Search Engines
Reasons why you should add Prices, Maps, References and Photos to a website.Why Add Prices & Media
Part One - Explains some of the realities of writing and submitting articles. The dis/advantages of submitting articles manually or via software submitter. Submitted article results. And much more.1) Articles - Basic Realities
Part Two - Explains about the Title, Content, Time Frames, Money and Feedback associated with article writing/submission.2) Articles - Basic Rules
Part Three - Shows my submitted articles results for September 2009 and October 2009, and then shows the overall results from July 2009 to March 2010.3) Articles - The Results
Highlights some of the common scams/techniques used by a bad Internet Marketer (Guru) to hook you into buying their Blueprint (Website Traffic/SEO/Search Engine/Etc Plan) and/or Tools.Avoid Scammer GURUs
Things to consider when deciding upon a screen resolution (screen size) to use for your website design/format/layout. The wrong screen size can lose you traffic.Website And Screen Size
Some DOs and DON'Ts advice with regards to forum traffic and the forum community.Forum Traffic Advice
Results and observations from splitting up a website into separate domains. Is one website better than three websites?Splitting Up A Website

This firefox add-on allows you to see your search results within other countries. A great traffic insight tool.Global Search Results
An excellent audio seminar telling it how it is traffic-wise.Traffic REALITY (Gold)

 

TOOLS

How to grab a colour from the screen, and edit a colour, using the colour picker/grabber called Color Cop. RGB, Hex and Decimal colour formats explained.Colour Grabber & Picker
How to take multiple snapshots (screen captures/photos) of the computer screen and save them automatically, as .bmp or .jpg files, with the screen grabber/snapshot program called Grabby.Take Snapshot Of Screen
How to set up and use the desktop screen recorder called Easy Screen Recorder. Records the desktop screen and microphone input as an .avi audio/video file. Easy to set up and use.Desktop Screen Recorder
How to set up and use Leawo Video Converter, so that you can convert video recordings into various video file formats. Video Codecs explained.Video File Converter
How to install the FTP Client (website file transfer program) called FileZilla.Install FileZilla FTP Client
How to set up a secure, ftpes, ftp connection (ftp account/website profile) for the FileZilla FTP Client.Set Up FTPES Connection
How to connect to your website (public_html) folder and upload folders/files to it, so they are live on the internet.Upload Files To Website
How to create a Google Analytics (Website Statistics) Account, so you can see visitor data and better cater for them.Create Analytics Account
How to install mozilla firefox web browser add-ons / plug-ins / Extension programs that can help with your website development.Install Firefox Add-Ons

 

CODING

How to embed a .flv flash video file inside a web page using JW Player or Flow Player.Embed A FLV Video File
One method of fixing the Internet Explorer 6 .png transparent image file problem.Fix IE6 PNG Transparency
Questionnaire Form (PHP Script) with Recaptcha. Example code with live questionnaire form given.Questionnaire Form

 

DESIGN

Which web browser(s) should you test your web pages with? Reasons why you should use.....Which Web Browser?

How To Write A Robots.txt Text File

Protect Your Website And Its Bandwidth

In a normal environment a Website Crawler (also known as a Web Spider or Web Robot) is a program or automated script, normally activated by a search engine, that scans your website (your public_html folder and the folders and files within it) collecting and analysing information about your website and its web pages (i.e. Number of Web Pages. Language used. Keywords and Phrases used) and their content (i.e. Number of Links. Code information. E-Mail Addresses. Is Audio/Video used?).

Some of that collected information is used by the search engine to give your website/web pages a ranking, position and subject matter (listing) within their search engine results, while the other information is used for Caching (storage) and Database purposes.



Search engine companies such as Google, Yahoo and Microsoft may also choose to pass on their collected information to third parties (i.e. other search engine companies). So a website crawler (web robot) is a good thing, in general. It searches your website for popularity, links, well written articles and other content in order to make your website/web pages viewable to as many people as possible via a search engine.

The downside to a website crawler though is that it tends to search every web page and folder within your public_html folder (your website). This is not good if you are using one of your folders as a "Members Only" area of your website, simply because the website crawler will collect information from the web pages inside that "Members Only" folder and more precisely keep a record of their location - inside your "Members Only" folder.

In turn, the search engines will list those "Members Only" web pages (as links). Add to this that I have mentioned A (one) website crawler, because I was giving you the definition of A (one) website crawler, when in fact a lot of search engines use their own website crawler these days and it means you now have many search engines listing your "Members Only" web pages (as links). The Robotstxt Database - Web Page Listing (or Robotstxt Database - Text List) lists over 300 website crawlers (web robots) for example, each with their given unique User Agent name (i.e. GoogleBot. MSNBot. Slurp. AskJeeves).

Fortunately, there is an answer to this problem but not a complete solution. It is called the robots.txt text file - A simple .txt (text) file, that can be created with a Text Editor such as Notepad, that allows you to give instructions to a website crawler. In particular, a DISALLOW instruction.

The DISALLOW instruction tells a website crawler not to scan certain folders and file types within your website (public_html folder). So you could disallow a website crawler from scanning your "Members Only" folder and the web pages within it for example. However. The reason why the robots.txt text file is not a complete solution is because it is not governed by any laws. Meaning. Website crawlers can ignore your robots.txt text file altogether and therefore ignore your DISALLOW instruction.

Once the robots.txt text file is created you upload (transfer) it to your public_html folder - A website crawler will read that robots.txt text file, if it wants to, before it scans your website (the content of your public_html folder).

To create a robots.txt text file begin by opening your favourite text editor (i.e. Notepad or Wordpad) and then type your instructions for the website crawler to obey (see below). From there. Save the text file with the filename robots.txt, using Notepad's (or Wordpad's) SAVE AS menu-item (Fig 1.1), before uploading (transferring) the text file to your public_html folder.




Fig 1.0  Open Notepad and then type your instructions for the website crawler to obey




Fig 1.1  Click on Notepad's FILE menu and select the SAVE AS menu-item to continue




Fig 1.2  Save the text as a text (.txt) file with the name: robots.txt




Fig 1.3  The Robots.txt text file when it has been uploaded to the public_html folder

In the above example Fig 1.0 shows the instruction User-agent: with a parameter of * (asterisk). This is followed by the instruction Disallow: with a parameter of /MembersOnly/. Together they are telling all User Agents (website crawlers) not to scan the content (folders and files) of the folder called MembersOnly.

USER  AGENTS  AND  ROBOTSTXT  INSTRUCTIONS

So far you have learnt that a website crawler is also known as a Web Robot, or Web Spider, and has a unique name known as a User Agent (otherwise known as a robot name or spider name). For example. The website crawler (web spider / web robot) with the user agent (robot name / spider name) of GoogleBot is the website crawler Google uses to scan your website (public_html folder) and return search engine results for the general public, based on information it scans from your website. The website crawler can, if it wants to, ignore your robots.txt text file though. Saying this. The major website crawlers do obey the instructions inside your robots.txt text file.

When a web page is shown in a search engine result as a link that web page is known as an Indexed web page. Its content (i.e. keywords and email addresses) has been scanned as normal but its path name and file name (full path name) has also been indexed as a link purposely for search engine results. Not all website crawlers use a search engine, therefore they only scan but do not index for search engine results - They may index for personal (i.e. links database) reasons though.



The User Agent: instruction normally expects a parameter that tells it which user agent to use. For example. The instruction User-Agent: GoogleBot, followed by the instruction Disallow: /MembersOnly/, would tell the website crawler called GoogleBot that when it reads the robots.txt text file it should not scan the content of the folder called MembersOnly and should not index its content (i.e. web pages) as links for search engine results. All other website crawlers would be allowed to scan and index the content of the MembersOnly folder.




Fig 1.4  Only disallow the GoogleBot website crawler from scanning and indexing MembersOnly

At this point you may be asking "What is the point of stopping one website crawler from scanning and indexing when the other website crawlers can do so?". Well provided that you do not have a private area on your website (i.e. a MembersOnly folder) one reason is because a certain website crawler might be scanning and indexing your website too frequently, innocently robbing your bandwidth in the process.

Remember. A website crawler also caches (makes a copy of) your web pages so the general public can view them when your web hosting provider's server (computer) is not working and therefore your website is not live (offline).

Another reason could be to stop a bad (malware) website crawler from scanning and indexing your website. Malware (Malicious Software) website crawlers scan and index your website looking to find private information and products (Membership Areas, Private Documents, Software You Sell and so on) in order to retrieve Passwords, Product Numbers, Database details and so on. In these cases you may want to stop all website crawlers by using the User-Agent: * and Disallow: / instructions together - A malware website crawler would ignore your robots.txt text file though!




Fig 1.5  Disallow all website crawlers from scanning and indexing your website's content

If you want to disallow all website crawlers from scanning and indexing a certain web page (i.e. a web page called news.htm inside the MembersOnly folder) you would use the User-Agent: * instruction followed by this Disallow: /MembersOnly/news.htm instruction.




Fig 1.6  Disallow all website crawlers from scanning and indexing the news.htm web page

To do the same thing but disallow only the GoogleBot web crawler you would use the User-Agent: GoogleBot instruction followed by the Disallow: /MembersOnly/news.htm instruction.




Fig 1.7  Only disallow the GoogleBot website crawler from scanning and indexing news.htm

If you want to add more than one folder to your disallow list simply put another instruction of Disallow: /FolderName/ into your robots.txt text file. For example. To disallow all website crawlers from scanning and indexing your cgi-bin folder, your images folder and your MembersOnly folder you would have the following robots.txt text file:




Fig 1.8  Disallow all website crawlers from scanning and indexing more than one folder

To specifically disallow GoogleBot from scanning and indexing your image files, where ever they are located in your public_html folder, you can use its image user agent called GoogleBot-Image instead.




Fig 1.9  Disallow the website crawler GoogleBot from scanning and indexing any image files

You can use Disallow: /images/ above, instead of Disallow: /, if you want to disallow your images folder only or you can stick to using User-agent: GoogleBot with Disallow: /images/. One reason for wanting to disallow images, a part from having a private/family photos you do not want the general public to see, is because of bandwidth theft.

Suppose you have a photo of a car on your website. Many websites might link to that car photo (i.e. http://www.yourwebsite.com/car.jpg), instead of displaying it directly from their own website's images folder, because they would not want people using their bandwidth if the car photo had to be downloaded from their website and/or because they do not have ownership of the car photo.



They would rather have people clicking on their CAR Link, linking to the car photo on your website, so that they are using your bandwidth and not theirs to display your car photo on their website from your images folder.

To allow a certain website crawler (i.e. GoogleBot) to scan and index your website's content but disallow all other website crawlers you would use the following instruction pairs. The empty line between the two pairs of instructions acts as a user agent separator. Meaning. The empty line allows you to build up a combination of user agent instructions.




Fig 1.10  Allow GoogleBot to scan and index your website's content but not other website crawlers

Although there are a couple of new instructions out there (namely ALLOW, SITEMAP and CRAWL-DELAY), as well as WildCards (i.e. the use of * and ?), this section has explained the main basics of the robotstxt instructions that would be needed by most website beginners and their website. However. If you wish to know more about robotstxt I would consider visiting the RobotsTxt website and this Search Tools website.




Bandwidth And Usage
Meta Tags Explained
INDEX