Usage Analyst
Usage Analyst imports usage logs to a SQL Server database. It supports 25 log file formats and provides many canned reports of Web activity in summary and detail form (e.g., the most popular pages, bandwidth utilization, and geographic breakdown of visitors and organizations). You can easily customize these reports to match your Web site construction or to satisfy your curiosity.
I wanted to use a real-world example, so I asked the Windows NT Magazine's Web master for some usage logs of the magazine's Web site. I received a 17MB log file of visits to the magazine's forums and a 204MB log of visits to the magazine's main Web site.
Defining a log file, server, and site. The first time you start Usage Analyst, the Server Manager prompts you to define a log data source, servers in the log file, and Web sites to analyze. When you define the log data source, you must identify the log file format (e.g., IIS, IIS extended, Apache). To analyze a log format on multiple HTTP servers, enter the servers' domain names to define the servers under one log data source. Each server can host multiple Web sites, so under the server icon you need to define each site you want to analyze. Usage Analyst identifies sites by URL. The Server Manager is a powerful tool for companies that host numerous subscriber Web sites on one server.
For instance, suppose you have three Web sites on one server running IIS and you want to analyze each of them. All the log data is in one IIS log file, so you define one log data source (e.g., d:\logs\webs.log), one server (e.g., winntmag.com), and the Web sites to analyze (e.g., http://www.win
ntmag.com) in Server Manager. Screen 4 shows Server Manager's description for the Windows NT Magazine log, with details of two imported log files.
To analyze a log file, start the Import Module and enter the usage log's filename. The Import Module prompts you for the sites you want to analyze. You must choose from the sites you defined in Server Manager. Click the green right-arrow button on the toolbar to start the import operation.
In my test, the 17MB log of 1 week's visits to the Windows NT Magazine forums took 7 minutes and 32 seconds to import. The 204MB log of 1 week's visits to the main Web site took 31 minutes and 36 seconds to import. Site Server's sample log file was hardly a reflection of the real world. It was only 102KB and imported in 6 seconds.
After you import the log file into the SQL Server database, you can generate usage reports. If you want reports based on page titles, domain names, and geographic summaries, run the following options on the Tools menu before you start the Analysis Module: Lookup HTML Titles, Resolve IP Addresses, and Who
Is Organizations. You need access to an Internet-based Domain Name System (DNS) server to resolve IP names and WhoIs queries.
Reporting on Web visitors. The Analysis Module's built-in catalog of reports works well for beginners and reviewers. Click File, Open Analysis Catalog, and expand each category to see options for Advertising, Auditing, Detail, and Summary reports. When you highlight a report, an explanation of its contents appears in another window.
To start a report, select Executive Report in the Summary category, and click Next. From the usage log, select the Web site you want to analyze, click Next again, and select the report time period from the options you see on the screen. The final dialog box asks if you want to restrict the report to a portion of the Web site by path name, domain name, or country. Clicking Finish pops up a screen that shows the report fields and details. Right-clicking a field lets you edit the field, filter data values, add a description, and set the font size of the headers. Screen 5 shows the fields in four built-in reports: Executive Summary, Hit Detail, Geography Detail, and Advertisement Detail.
To finalize the report, specify the name and output format. For report output, you can choose HTML, Microsoft Word, Microsoft Excel, or fixed-width text. Click the green right-arrow button on the toolbar to create the report. When the report is complete, Site Server automatically opens Word, Excel, or a browser window to display the results.
After I imported my 204MB log file, the Analysis Module took slightly longer than 11 minutes to generate the Executive Summary report. Thus, if your monthly log file is larger than 1GB, I recommend that you import data and generate reports during off-hours. The scheduler utility in the Tools menu automates import and analysis jobs with an interface similar to NT's command scheduler, winat.exe.
The last page of each report includes definitions of terms in the report. You need to thoroughly review these terms to fully understand the data in the report. For example, Site Server records a hit whenever someone touches a page. A hit is any type of connection to an Internet site. The number of hits is relatively meaningless information, because it includes accidental visits and errors. Site Server records a request when the site responds to a user's query for information. The number of requests is useful information, because it includes only visitors searching for content. Site Server records a visit when a user makes consecutive requests from a Web site.
I recommend creating usage reports in Word format. The HTML version creates graphs that are too large to view easily, and it produces an empty weekly trend graph if you analyze only 1 week's data. If you plan to use the HTML report frequently, you'll want to create a version that omits the weekly trend summary.
My test logs for the Windows NT Magazine Web site were from December 3 to December 13, 1997. The Executive Summary and Hit Detail reports produced the following usage statistics:
- Daily Web hits average 6900, evenly spread between work and nonwork hours.
- The number of daily visits ranges from 3000 to 8000.
- The number of daily visitors varies from 2500 to 6200.
- Requests for content pages total more than 1 million per week.
Visitors request an average of 27 pages.
- Visitors browse the Web site 24 hours a day; daytime traffic is twice as high as nighttime traffic.
- Visitors spend an average of 7 minutes on the Web site.
Another interesting summary ranked the top 20 organizations to visit the site. Analysis revealed that Microsoft is the number one visitor to the magazine's Web site, followed by UU.NET, America Online (AOL), and CompuServe. Windows NT Magazine employees and associates rank eleventh, and a large per-
centage of visitors are employees of Digital Equipment, IBM, HP, MindSpring, and Boeing.
The usage by country summary report showed that visitors from the US account for 83 percent of visits. Visitors from the UK, Australia, and Canada represent 3 percent each. The report showed visitors from all over the world, including Austria, Indonesia, South Korea, Malaysia, Thailand, South Africa, Japan, Brazil, Israel, New Zealand, Taiwan, and Poland. International visits explain why the Web site has activity 24 hours a day.
But Wait--There's More!
Site Server is a complex product with a steep learning curve and numerous customization features. You need to use the software repeatedly to fully understand its options. After using the software for about 60 hours, I grasped the basics and some of Site Analyst's and Usage Analyst's more sophisticated features.
I didn't fully explore Site Analyst or Usage Analyst, but I noted some needed improvements for the next version.
The cyberbolic view for large maps needs some fine-tuning. You need a delicate touch to properly line up the dynamic view to expose objects that are many levels deep. Viewing multiple links is difficult, even on a 17" monitor after tweaking the node distance to fit information on the screen. When the node distance is close, the labels overlap and are unreadable. Increasing the node distance reduces the number of objects Site Server displays. The software doesn't display flyover labels long enough for you to read them, especially when an object is many levels deep in the hierarchy.
Site Server also needs an option to concurrently map multiple sites or import multiple log files. Currently, a site map or log import ties up the window until the operation completes. Importing my 204MB log file tied up the screen for 31 minutes.
Another problem is that when you delete a site from Server Manager in the Import Module, you can still select the site in the Analysis Module. You obviously can't analyze a site you've deleted, so you shouldn't be able to select it.
Finally, I couldn't figure out how to report on each visitor's unique address. I suspect Site Server offers this feature, but I didn't spend a lot of time digging for it.
Overall, Site Server is a useful product that helps you maintain good onsite and offsite links. In the cyberbolic view, the software automatically updates the link dialog box as you move from object to object. This feature saves you time because you don't have to continually open new dialog boxes. Different sites typically want different information, and Site Server's canned reports give you a convenient template for generating new reports. If you want nonstandard usage information or simply like treasure hunting on Web sites, you'll want to consider Site Server.
End of Article
Prev. page
1
[2]
next page -->