Tall Emu Pty Ltd requested a free test of the paid version of its popular product. The paid version of Online Armor Personal Firewall 220.127.116.11 scored 98% and took the lead back from Comodo. We have also tested a new version of ZoneAlarm Pro and the latest version of Filseclab Personal Firewall today.
Check Firewall Challenge pages for more information.
History and introduction
Firewall Challenge is a project that replaces our older project Windows Personal Firewall Analysis and its subproject Leak-testing. As a part of Window Personal Firewall Analysis project we have deeply analysed security products but we found out soon that such a testing was extremely time consuming.
It was not possible to test as many products as we wanted to.
On the other hand, Leak-testing seemed to be a very easy way how to test many products in reasonable time.
However, Leak-testing is not able to cover many of the important features of the desktop security products.
We have decided to combine the simplicity and effectivity of Leak-testing with the scope of our deeper analyses and created this project – Firewall Challenge.
This project examines personal firewalls, Internet security suites and other simila products for Windows OS that implement process-based security. We call all such products personal firewalls. In our opinion, personal firewalls should prevent spying and data and identity theft. So, we require personal firewalls to include host protection features too.
The list of personal firewalls we are aware of is available on the product list page. We know
that our terminology may be in conflict with the common understanding of what the firewalls are. To distinguish between personal firewalls and firewalls in the common sense, we call the later packet filters. A typical example of a packet filter is WIPFW. Most of the personal firewalls include a packet filter component. Simple packet filters are not worse than personal firewalls, they are just different kind of software – for different kind of users. This project does not examine stand-alone packet filters.
Methodology and rules
The tested firewalls are installed on Windows XP Service Pack 2 with Internet Explorer 6.0 set as the default browser.
The products are configured to their highest usable security settings and tested with this configuration only.
We define the highest security settings as settings that the user is able to set without advanced knowledge of the operating system. This means that the user, with the skills and knowledge we assume, is able to go through all forms of the graphic user interface of the product and enable or disable or choose among several therein given options, but is not able to think out names of devices, directories, files, registry entries etc. to add to some table of protected objects manually.
There are several testing levels in Firewall Challenge. Each level contains a selected set of tests and it also contains a score limit that is necessary to pass this level. All products are tested with the level 1 set of tests. Those products that reach the score limit of level 1 and thus pass this level will be tested in level 2 and so on until they reach the highest level or until they fail a limit of some level.
Most of the tests are part of Security Software Testing Suite, which is a set of small tests that are all available with source codes. Using this open suite makes the testing transparent as much as possible. For each test the tested firewall can get a score between 0% and 100%. Many of the tests can be simply passed or failed only and so the firewall can get 0% or 100% score only. A few tests have two different levels of failure, so there is a possibility to get 50% score from them. The rest of the tests have their specific scoring mapped between 0% and 100%. It should be noted that the testing programs are not perfect and in many cases they use methods, that are not reliable on 100%, to recognize whether the tested system passes or failed the test. This means that it might happen that the testing program reports that the tested system passed the test even if it failed, this is called a false positive result. The official result of the test is always set by an experienced human tester in order to filter false results. The opposite situations of false negative results should be rare but are also eliminated by the tester.
To be able to make right decisions in disputable situations, we define the test types.
Every test has some defined type. Tests of the same type always attempt to achieve the same goal.
Here is a list of the defined types and their goals:
- General bypassing test: These tests are designed to bypass the protection of the tested product generally,
they do not target a specific component or feature. This is why they attempt to perform various privileged actions
to verify that the protection was bypassed. These tests succeed if at least one of the privileged action succeeds.
Like the termination tests, general bypassing tests can not be used without modifying the configuration file.
- Leak-test: Leak-tests attempt to send data to the Internet server, this is called leaking.
Most of the leak-tests from Security Software Testing Suite
are configured to use a script on our website that logs leaks to our database by default.
For such tests, you can use My leaks page to see whether the test was able to
transmit the data. For leak-tests that do not use this script, we use a packet sniffer in unclear situations.
- Performance test: Performance tests measure impacts of using the tested product on the system performance.
The measured values provided by the tests on the system with the tested product installed are compared to the values
measured on the clean machine. Every software affects the system performance at least a little bit.
To give products a chance to score 100% in these tests, we usually define some level of tolerance here.
This means that if the performance is affected only a bit, the product may score 100%.
- Spying test: These tests attempt to spy on users' input or data. Keyloggers and packet sniffers
are typical examples of spying tests. Every piece of the data they obtain is searched for a pattern, which is defined in the
configuration file. These tests usually succeed if the given pattern has been found.
- Termination test: These tests attempt to terminate or
somehow damage processes, or their parts, of the tested product.
The termination test usually succeeds if at least one of the target
processes, or at least one of their parts, was terminated or damaged.
All the termination tests from our suite must be configured properly
using the configuration file before they can be used for tests.
- Other: Tests that do not fit any of the previously defined types are of this type. These tests, for example, may check
stability or reliability of the tested product.
For all tests in all levels that the product did not reach, the product's score is 0%. For all other tests the score is determined by the testing. The total score of the product is a sum of the scores of all tests divided by the number of all tests and rounded to a whole number.
It may happen that a new test is added to Firewall Challenge when some products already has their results.
In such case, the result for already tested product is set to N/A for this new test, which means that it is not counted for this product and does not affect its score or level passing. Neither the number of the tests, nor the number of levels is final.
We intend to create new tests in the future. We are also open to your ideas of new testing techniques or even complete tests.