Georgia Tech Honeynet Report (April 1, 2004 -- September 15, 2004)

 

1.0  HONEYNET DEPLOYMENTS

=========================

   1.1 Current technologies deployed.

 

We are running a GEN II Honeynet with a variety of OSs of interest.  We continue to use live OSs instead of VMware or HoneyD.  Our web page with a diagram of our current setup is located at:  http://users.ece.gatech.edu/~owen/Research/HoneyNet/HoneyNet_home.htm.  Since our last report, we have restructured our honeynet and converted to the Honeywall CD configuration; we also now conduct all monitoring of the honeynet on an analysis box that is separate from the Honeywall (bridge) machine.  We continue to deploy a Darknet within our Honeynet and have added a SUN workstation during this period.  Our focus continues to be the use of the Honeynet to help secure the campus network. 

 

   1.2 Lessons learned from the technology, what you like about it.

 

Honeynets can be incorporated into an organization’s network security plan to help secure the network. We work closely with Georgia Tech’s Office of Information Technology to help secure the campus network.

                We have improved our log file naming convention.  Previously we created a new folder for each day’s log file(s) and named the folder by the month and day (i.e. Mar15, Apr24, Aug12).  This format required separate storage locations for different years, and was more difficult to navigate both within Ethereal and in developing scripts that run on a series of logs because the folders were sorted alphabetically by month instead of in date order.  Our improved format is to create a folder for each day in the numerical form of <yearmonthday> (i.e. 20020315, 20030424, 20040812).

                Our new setup with the honeywall CD, separate analysis box, and cleanly wired honeynet rack appears to be stable and makes it much simpler to adjust the configuration of the honeynet; adding a honeypot to the network or quickly disconnecting a honeypot from the network is now a simpler task as a result of the improvements made this summer.

                The data consistency created by using the honeywall CD is very beneficial.

 

   1.3 Lessons learned from the technology, what is lacking, what you

would like to see improved.

 

                We need better methods to analyze the large amount of data collected by the honeynet.  We are working towards developing tools to both analyze and visualize the data collected.

 

2.0 FINDINGS

=============

 

    2.1 Number and type of systems compromised during six month period.

 

We have had two hoenypots compromised by worms since our last report.  One machine was a Microsoft Windows 2000 system and the other was a Linux RedHat 7.3 system.

 

    2.2 Highlight any unique findings, attacks, tools, or methods.

 

                We found it interesting to observe the large spike in traffic to the honeynet from the Georgia Tech address range as students moved back in to start the Fall semester.  Over the course of the summer, a typical day would include less than five (and often zero) IP’s attempting to establish connections to the honeynet from within Georgia Tech’s two and a half Class B address range.  As students returned for the Fall semester, the number of IP’s attempting to establish connections increased dramatically, with a typical day including 10-20 separate IP’s attempting to establish connections.

                Using a darknet to increase our IP address range provides additional traffic to include in our analysis (without the addition of physical honeypots) and helps us to observe scanning patterns.

We had 489 unique machines on the Georgia Tech campus that attempted to connect to the Honeynet between January 1st, 2004 and August 31st, 2004.  (These machines are assumed to be compromised or in use by a malicious person.)

 

    2.3 Any trends seen in the past six months;

 

                                The majority of traffic to our honeynet originating from within the campus network attempts to establish connections to Port 445.  Between January 1, 2004 and August 31, 2004, 795 machines from within the campus attempted to establish a connection using port 445.  (This number includes repeat attacks from a single IP address, but only counts one connection attempt per machine per day to port 445.)

                We found an article that mentions the use of TCP ports 135 and 1026 to send popup spam to Windows systems.  On at least one occasion we observed this type of traffic, but it did not result in a popup window on our honeypot.

 

    2.4 Document data analysis tools and methods being used.

 

                We currently use ethereal with various filters as our primary data analysis tool.  We are developing perl scripts to parse through the logs and generate plots of the data.  (PCAP data is parsed and the extracted results are stored in an XML file.  Plots are then generated from the XML data.)

 

    2.5 For data analysis what tools work well, and what still needs to 

be developed.

 

                Ethereal is not a very efficient or effective way to analyze daily logs, especially as the size of the honeynet is increased, resulting in additional traffic.  We have begun to develop perl scripts, but this method appears to be too slow as well.  What we need to do is implement our perl scripts in C.  We also are considering the use of an SQL database to be able to cope with the large amount of data and access it more quickly.

                We could reap huge benefits from a data analysis CD in addition to the honeywall CD.  The data analysis CD could be used to setup your

analysis box and could be another way to standardize honeynets throughout the alliance.

 

3.0 MISC ACTIVITIES

====================

3.1  Presenting at conferences

 

The papers listed in 3.3 were presented at conferences.

 

3.2  Developing, testing or releasing code

 

                We have developed several perl scripts for data analysis, but we believe implementing these (and other) scripts in C will prove more beneficial for data analysis.  We also have a PhD student working on network security visualization techniques and are using the software he is developing to observe our honeynet data.

 

3.3  Publication of papers

 

                The following papers were published and presented at the IEEE Information Assurance Workshop at West Point, New York:

"Application of a Methodology to Characterize Rootkits Retrieved from Honeynets" by John Levine, Julian Grizzard, and Henry Owen; and

"An Investigation of a Compromised Host on a Honeynet Being Used to Increase the Security of a Large Enterprise Network" by Timothy Jackson, John Levine, Julian Grizzard, and Henry Owen.

 

The following paper was published an presented at DEFCON 12:

“Network Attack Visualization” by Greg Conti.

 

The following paper was published and presented at the Recent Advances in Intrusion Detection (RAID) Symposium: 

"HoneyStat: Local Worm Detection Using Honeypots" by David Dagon, Xinzhou Qin, Guofei Gu, Wenke Lee, Julian Grizzard, John Levine, and Henry Owen.

 

3.4  Involvement in SotM challenges.

 

                We have not participated.

 

3.5  Other

 

                We have developed a honeynet continuity file for the Georgia Tech Honeynet.  One of the challenges of a student run honeynet at an academic institute is that students (graduate and undergraduate) arrive and depart on a regular basis.  For example, John Levine, who created the Georgia Tech Honeynet, graduated with a PhD in May 2004 and is now an instructor at West Point, and Julian Grizzard, who is currently in charge of the honeynet, expects to graduate with his PhD in 2005.  We developed the continuity file to streamline the process of teaching new people how to run and monitor the honeynet, as well as serve as a source for lessons learned.  The file includes configuration information, points of contact (internal and external), web sites of interest, logging information, and policy guidelines for monitoring our honeynet.  We recommend a continuity file for any organization, but especially for academic institutes.

 

4.0 ORGANIZATIONAL

==================

 

4.1  Changes in your structure of your organization.        

 

We have new undergraduate students getting involved in the honeynet, to include Neil Joshi, and Alfredo Ramos.

 

5.0 LESSONS LEARNED

===================

 

5.1  What positive things can you share with the community, so they can replicate your success.

 

                The Georgia Tech Honeynet is a great tool for helping to secure the campus network.  Since all traffic to the Honeynet is suspicious, any packet to the Honeynet originating from within the Georgia Tech address range is likely from a compromised computer, a malicious user, or the campus IDS.  We send reports of all computers attempting to connect to the Honeynet to the campus network managers (OIT); they can then take action to keep the network secure by correlating our data with their IDS tools in order to reduce false positives.

                The honeywall CDROM has worked out well.  When making significant changes to a honeynet, we recommend making the changes in a down

period in case there are configuration issues.  Making the transition to the honeywall CDROM during the summer semester proved very beneficial.  We also recommend testing any configuration changes on a private network prior to connecting to the internet.

 

5.2  What mistakes can you share with the community, so they don't make the same mistakes.

 

Parsing through large data sets can be very time consuming.  We need better tools that the community can use to make this analysis easier and more efficient.

 

6.0 GOALS

=========

 

6.1 Plans/Goals for next six months.

 

                We are currently working on writing a Know Your Enemy Paper based on a statistical analysis of our data.  We also intend to add honeypots to our honeynet within the next few months, with a long term goal of developing a distributed honeynet across the Georgia Tech network.  We also plan to develop more elaborate quality analysis tools (in a beta state right now) to share with the community.