URL categorization classifies websites based on the type of content – blogs, news, sports, adult, porn, violence, etc. zvelo’s URL categorization database is used to improve internet safety and security with solutions for malicious or phishing detection, web filtering and parental controls, brand safety, contextual targeting, subscriber analytics, and more.
As a web content categorization company, we are intensely focused on the trends in the types of content being published on the web, how this content is accessed, used and shared, who is publishing the content, and hundreds of other details that goes into our efforts to provide the market’s best web categorization services.
Over many years or testing, trial and error, zvelo ultimately determined that a human-machine “hybrid” approach to classification produced the best outcomes. The Human element provided the verifications necessary for the highest levels of accuracy, while machines (ie. AI/ML models and calculations) provided the scaling necessary to deal with the incredible volumes of new URLs and content being published at an increasing rate.
The URL checker found on the zvelo.com homepage, previously known as the “Test-a-site” tool, serves to demo various contextual categorizations about URLs that can be derived by licensing zvelo contextual categorization and malicious website detection services. When queried, the URL checker yields a sample of data sets stored within the zvelo URL database, via…
If one performs the search “use www or not,” well over a billion results in many of the most popular search engines are returned. The focus of each result may differ. For zvelo, the usage is irrelevant because its contextual categorization processes are designed to identify and handle each component of a URL. At a simplistic view, the basic components of a URL are the following:
Anatomy of a Dynamic Website Of the hundreds of billions of URL queries zvelo has received for website categorization in 2013, an estimated 27% have been classified as being dynamic (see image 1). Dynamic categories in this data sample included Social Networking, News, Search Engines, Personal Pages & Blogs, Community Forums, Technology (General), and Chat.…