Data collection and web data mining are critical processes for many companies and the marketing companies today. The techniques usually used include search engines,
topic-based searches and directories. Web data mining is necessary for any business that wants to create data warehouses by harvesting data from the internet. This is so
because high-quality and intelligent information may not be harvested from the internet easily. Such information is critical as it enables you to get desired results and the
business intelligence in demand.
Keyword-based searches are important in marketing of company products. They are usually affected by the following factors:
• Irrelevant pages. The use of common and general keywords on the search engines yields millions of web pages. Some of thesepages may be irrelevant and may not be of help
to the user.
• Ambiguous results.This is usually caused by multi-variant or similar keyword semantics. A name would be an animal, movie or even a sport accessory. This results in web
pages that are different what you are actually searching for.
• Possibility of missing some web pages.There is a great possibility of missing the most relevant information that is contained on web pages that are not indexed on a given
keyword.
One of the factors that prohibit the usage of web data mining is the effectiveness of search engine crawlers. This is widely evidenced by lack of access of the entire web due to
search engine crawlers and bot.This can be attributed partly tobandwidth limitations. It is important to understand that there are thousands of databases on the internet that can
deliver well-maintained information, high quality and are not easily accessed by crawlers.
In web data mining it is important to understand that majority of search engines have limited choices or alternatives for keyword query combination. For instance, yahoo and
Google offer option like phrase and even the exact matches that may limit even the search results. It is usually demands more efforts and even time and thereby get the most
important and relevant information.The human behavior and the alternatives usually change of time.This therefore implies that web pages need to be updated frequently and
there by reflect on the emerging trends. It is important to realize that there is a limited space for web data mining. This is so because the information that currently exists is
heavily relied on keyword-based indices. This does not apply for the real data.
It is important to realize that web data mining is an important tool for any business. It is therefore important to embrace this technology to solve data crisis problems. There are
several limitations and many challenges which may have resulted in the quest of effectively and efficiently in rediscovering the use of web resources. However, irrespective of the
challenges of web data mining, this technology is an effective tool that can be employed in many technological and scientific fields. It is therefore paramount to embrace this
technology and use it fully in order to realize your corporate goals.
Source: http://goarticles.com/article/Challenges-in-Effective-Web-Data-Mining/6771744/
topic-based searches and directories. Web data mining is necessary for any business that wants to create data warehouses by harvesting data from the internet. This is so
because high-quality and intelligent information may not be harvested from the internet easily. Such information is critical as it enables you to get desired results and the
business intelligence in demand.
Keyword-based searches are important in marketing of company products. They are usually affected by the following factors:
• Irrelevant pages. The use of common and general keywords on the search engines yields millions of web pages. Some of thesepages may be irrelevant and may not be of help
to the user.
• Ambiguous results.This is usually caused by multi-variant or similar keyword semantics. A name would be an animal, movie or even a sport accessory. This results in web
pages that are different what you are actually searching for.
• Possibility of missing some web pages.There is a great possibility of missing the most relevant information that is contained on web pages that are not indexed on a given
keyword.
One of the factors that prohibit the usage of web data mining is the effectiveness of search engine crawlers. This is widely evidenced by lack of access of the entire web due to
search engine crawlers and bot.This can be attributed partly tobandwidth limitations. It is important to understand that there are thousands of databases on the internet that can
deliver well-maintained information, high quality and are not easily accessed by crawlers.
In web data mining it is important to understand that majority of search engines have limited choices or alternatives for keyword query combination. For instance, yahoo and
Google offer option like phrase and even the exact matches that may limit even the search results. It is usually demands more efforts and even time and thereby get the most
important and relevant information.The human behavior and the alternatives usually change of time.This therefore implies that web pages need to be updated frequently and
there by reflect on the emerging trends. It is important to realize that there is a limited space for web data mining. This is so because the information that currently exists is
heavily relied on keyword-based indices. This does not apply for the real data.
It is important to realize that web data mining is an important tool for any business. It is therefore important to embrace this technology to solve data crisis problems. There are
several limitations and many challenges which may have resulted in the quest of effectively and efficiently in rediscovering the use of web resources. However, irrespective of the
challenges of web data mining, this technology is an effective tool that can be employed in many technological and scientific fields. It is therefore paramount to embrace this
technology and use it fully in order to realize your corporate goals.
Source: http://goarticles.com/article/Challenges-in-Effective-Web-Data-Mining/6771744/
No comments:
Post a Comment