Web Server Log Analyzer
Web Server Log Analyzer
ABSTRACT
Large amount of useful operations data and warnings of
failures are available in the server logs. The challenge is that such
information gets increases with time, as a result of it number of entries
in the logs, which can quickly grow to unmanageable size. Automating
the analysis of server logs is essential to allow using the logs as a
proactive administrator tool. Log analyzer is 3-tier architecture based
software that parses through the log files generated by whichever web
server follows the standard web server logging standards. Analyze
parsed data and categorize them into meaningful reports that can be
read by the user for administration or monitoring purpose. A software
application designed to parse a log file, which typically contains raw
collected data, and convert it to an easy-to-read and understand form.
1. Specification Acceptance:
It may happen that any organization has more than one
server for its working. So it is important to analyze the logs of all the
servers contained in it. In our system we are going to take the logs
from all the server machines of an organization. So that it will show the
graphical or tabular data of organization as single server.
In our project we are firstly accepting the server names
long with the required specification that the administrator wants on
the home page. We are going to analyze the log files of the server
specified by the application user that is admin of server. The analysis
result we are going to show as per the specification given by the user
on home page. The application user also going to choose whether the
representation of analysis will be in graphical format or tabular format
or any other format and also if he requires the actions over the error
logs occurred.
Middleware:
This is a dedicated server pc which is used to handle and control all
backend processes. It contains the database which stores the parsed
logs in a table. It takes the input from the agent programs and parse
them and store them. Also it takes the request from the front end to
search and calculate the data for representation. It will be a system
service which continuously runs in background.
Presenter:
It represent the data in required graphical or tabular form. In this
system presenter be a java server pages which controls the users
activity. It accept the parameters to analyze the data and send request
to middleware. Then middleware search and calculate values and send
it to presenter. By using the ready java classes the data can be
represented in a proper graphical form.
Communication
In this three tier architecture important thing is
connectivity and communication between three components.
Remoting service can be used for this communication
STRUCTURAL FLOW OF THE SYSTEM
The overall flow of the system goes through these stages as shown in
figure
CONCLUSION
Web logs provide vast amounts of information about the
use of their websites. Web logs can provide information about which
user groups access the website, which resources are viewed the most,
and the links users follow to arrive at the site. In order to process the
large amounts of data generated by most websites, log analysis
software must be utilized. This software produces reports with
summary statistics about website. Our program analyzes the
information in server's log files and creates detailed reports. It can tell
when server was down and for how long, how much bandwidth site
has been using. Also this system has main advantages of accessing logs
of all servers of an organization at one time so admin has a great
advantage of it.
REFERENCES
[1]Dilip sisodia and shrish warma ”Web Usage Pattern Analysis
Through Web Logs: A Review”2012 ninth international conference on
computer science and engineering.
[2]Nathaphon Kiatwonghong and Songrit Maneewongvatana “Intel
LogLog: A Real-time Log Analyzer” 201O 2nd International Conference
on Education Technology and computer (ICETC)
[3]Wichian Premchaiswadi“ Extracting WebLog of Siam University for
Learning User Behavior on map reduce” 4th International Conference
on Intelligent and Advanced Systems (ICIAS2012)
[4] L. Liberti “Log Analysis Software Architecture”
[5]Chen Hu, Xuli Zong, Chung-wei Lee and Jyh-haw Yeh, “World Wide
Web Usage Mining Systems and Technologies”, Systemic, Cybernetics
and Informatics Vol. 1 – Number 4.
[6]The Apache Software Foundation, “Log files,” http:// httpd
apache.org/ docs/ 1.3/ logs.html, 2010.
[7]Yuan, F., L.-J. Wang, et al. (2003). Study on Data Preprocessing
Algorithm in Web Log Mining. Proceedings of the Second International
Conference on Machine Learning and Cybernetics, Wan, 2-5 November
2003.
[8]Thomas Reidemeister, Miao Jiang and Paul A.S. Ward (2011).
Mining Unstructured Log Files for Recurrent Fault Diagnosis 12th
IFIP/IEEE 1M 2011: Mini Conference 77