Alex Elliott
The internet home of a prospective software engineer
This is my personal blog where I discuss projects that I'm currently working on, work I've recently completed, or write about any topic which has caught my interest in the world of Computing from my studies or from my personal research.
Latest Articles
My Little FAQ For PHP Pitfalls
November 6th, 2009
There are a few questions which come up time and time again in Zymic’s support channels (Zymic IRC, or the forums) and having answered them several times, I feel I would like to spend some time writing up a response which I hope can make handling these support queries a bit easier. So, ok, firstly:
PHP Notices
Since Zymic’s PHP is set to display errors of the level E_NOTICE, users often find they are getting errors on Zymic that they are not getting on their local test server or previous host. E_NOTICE level errors are good practice recommendations that point out where you’ve written valid but improper PHP. It is recommended that rather than suppress E_NOTICE level warnings you display them during the development process and fix your code so that the errors are never generated. This improves the quality and reliability of your code.
If your local test server does not display notices then I would recommend you change your local PHP configuration to start generating them. You can do this by modifying the “error_reporting” directive in your php.ini file. In a system where notices are disabled it is typically set up as this:
error_reporting = E_ALL & ~E_NOTICE
This format takes E_ALL, a group of many error levels including notices, and then removes notices from the list by adding ~E_NOTICE (not E_NOTICE). So you can enable E_NOTICE level errors by making it simply:
error_reporting = E_ALL
The next few topics are about some specific E_NOTICE level errors you may have found and what they mean then how to resolve them.
PHP Notice: Undefined Index or Undefined Variable
This notice refers to using a variable or a member of an array that has not been defined prior to its use. The simplest example of this is to take this code:
1 2 3 | <?php echo $foo; // At no point have we defined $foo, so using it is probably bad ?> |
This will produce an error along the lines of “PHP Notice: Undefined variable: foo in __FILE__ on __LINE__”. This is obviously a very trivial example and it rarely comes that simple in real code, but it is an illustration of what triggers this error. Attempting to reference some data that doesn’t exist can easily lead to unexpected behaviour, and it’s fairly obvious that all variables/array members you try to read from should be defined and initialised before you try that read.
A common context where “Undefined Index” will turn up is in people handling the superglobals like $_POST. If you attempt to use $_POST['foo'] without checking some input with the name “foo” has been posted to the script will trigger this notice. So, that’s what the notice is and what causes it, but how can you prevent it from appearing? Well, there are some tests that PHP has which can check the data, and then you can make sure that you’re only attempting to read the data if the variable/array member is set or is not empty. So say you want something that outputs “Hello, {name}” to the browser and it works on GET data, an implementation which will trigger this notice might be:
1 2 3 | <?php echo 'Hello, ', htmlspecialchars($_GET['name'], ENT_QUOTES); ?> |
But this script doesn’t work particularly well when a name isn’t provided, you will get a notice and the output will just be “Hello, “, which is not particularly meaningful – we can do better than that. So if we add in a check using PHP’s empty() function to see if there is some data provided we could instead write:
1 2 3 4 5 6 7 | <?php echo 'Hello, '; if(empty($_GET['name'])) echo 'Stranger'; else echo htmlspecialchars($_GET['name'], ENT_QUOTES); ?> |
This is very similar to the last, but it will not trigger a notice if there is no name provided to the script, and in fact handles it by instead outputting “Hello, Stranger” if no name is provided. This is a bit more elegant, and there are many ways you could tackle this. You could have the case where if no input is provided we show a form instead for the user to provide a name.
PHP Notice: Use of undefined constant
This one is very much similar to the last one, but is worth mentioning separately because there’s enough to be said about the mistakes that lead to it, and how you can make these mistakes and not realise it without the benefit of notices. This notice is about the use of undefined constants in code, the interesting thing is how PHP handles a case where an undefined constant is found. It will trigger this notice, but since this is valid PHP it will take a value for the constant for the script to use, and that is the identifier (or name) of the constant, thus an undefined constant foo will evaluate to the string ‘foo’.
Because of this fact you often see cases where an array member is referenced via $array[foo] rather than $array['foo']. If notices are disabled, then it is possible that the author of that code will not notice the mistake (though syntax highlighting in your editor should mitigate this somewhat) because while foo is an undefined constant the two are equivalent. However, this is not always the case. An easy example to demonstrate this is this. Take file1.php to be this:
1 2 3 4 5 | <?php $array = array( 'foo' => 'Hello there', 'bar' => 'Goodbye'); echo $array[foo]; // Note here we've not quoted foo so it's a constant, // and in the context of just this script it's undefined. ?> |
When you visit this page you will find it triggers this notice and outputs “Hello there”, however if we were to also have a file2.php containing this:
1 2 3 4 5 | <?php define('foo','bar'); // define the constant foo with the value "bar" require 'file1.php'; // and then bring in the file we just wrote, // with this constant in scope ?> |
When you visit file2.php you’ll find the notice has gone and we’re now presented with “Goodbye”, this is the crux of the issue, when you unintentionally use an undefined constant you no longer have any idea how that script will run when it is part of a larger program. What it will actually do is now undefined behaviour. So, if what you mean to write is a string, make sure it has quotes around it, so the PHP parser knows that it’s a string – not a constant.
That’s it… for now
I may add to this at a later date to add in anything else that gets asked frequently, we shall see.
Musings on Syntax Highlighting for Websites
February 6th, 2009
Syntax highlighting can be very important to some websites, particularly those featuring articles on programming practice/theory or pastebin/nopaste websites for collaborative debugging. However, most highlighting packages tend to use pattern matching to attempt to correctly highlight a given document rather than a more accurate but more complex lexical parser and do not have the capacity to use multiple highlighting schemes for a given document internally. If you choose to highlight something as PHP, then only the PHP segments will be syntax highlighted, when it’s plausible that there will also be HTML, XML, Javascript, CSS, etc in the same document.
These are two features lacking from most existing syntax highlighting packages today, and ones that I think would be extremely useful to have in publicly available free software tools. The question is simply whether it is feasible to include them, or whether what we’ve got currently is as good as it’s likely to get.
Pattern Matching versus Lexical Parsing
These are the two main ways of taking a source document and producing a highlighting for it. Pattern matching uses regular expressions to attempt to catch recognisable patterns in the given language which is simpler to produce, but does not guarantee good results. Lexical parsing on the other hand is a much more complex but when done more flexible method for producing a highlighting of some input.
Lexical parsing involves going through the input from start to finish breaking the input up into “tokens”, which are small segments of the input with some associated meta-data. In essence it breaks the code provided down into it’s components: strings, keywords, variables, etc. The power of this model is that while the parser is working it can use its state information on things like scope and context to provide more accurate and more informative details. In fact, a full lexical parser would be able to identify syntax errors and highlight them automatically.
As to providing more information, with tokenised input it would be fairly trivial to note which braces/brackets match one another, and unlike a pattern matching system you can include information from other parts of the program – take the simple example of a C++ typedef, something simple like “typedef vector<string>::iterator vec_iter”, which provides a new shorthand type “vec_iter” as a vector<string> iterator. While a pattern matching model could probably work out that vec_iter was a type, it would not know what it represented, or if it was valid. A lexical parser would be able to add a note saying “this is a vector<string> iterator” provided the typedef was in the provided sample.
Of course, while it is probably a superior method from a functional standpoint, it is significantly more complicated to implement. Which raises the question of whether the benefits are worth the extra outlay of effort required to produce the highlighter. My personal view is that pattern matching for the moment is the better option for things like articles, where we are confident that the input is a valid piece of code – and thus should be fine in a normal highlighter. For uses like pastebin/nopaste sites though, it would be beneficial to have this kind of extra information since they are often used for collaborative debugging, and so highlighting of syntax errors, and other possible errors like a definition of a used type not being available (this might not be a true error as the definition may be in a file not provided to the highlighter, but it could still be worth noting – and it would definitely be useful for self-contained testcases).
Language Nesting in Code Samples
The other limitation in many existing syntax highlighters is that they are not able to apply several different language highlighting schemes to one piece of provided input. This can be annoying when you’re highlighting things like web pages, which can easily contain HTML, with nested CSS (in <style></style>), nested JS (in <script></script>) and perhaps server-side languages like some PHP (in <?php ?>).
For the most part, just selecting one to highlight works, since it’s unlikely that more than one requires significant attention at once, however there are situations where it would be useful to have each block highlighted separately. However, this would either require the user to select ranges of code to highlight in different language engines, or it would require the highlighter to attempt to automatically determine what language segments of code are. The first is tedious for the end-user, and would likely lead to the product not being used, and the latter adds significant complication to the highlighter.
These things are something I would like to see included in the functionality of pastebin/nopaste websites, but due to the complexity I can’t expect them to just turn up one day. So, given that I figure I might give it a go, simply writing a fairly cut-down proof-of-concept to maybe appear with pastesite one day (in C++, not PHP since I expect the performance of PHP to not be capable of this satisfactorily). As to whether I’ll ever finish it, that remains to be seen, but I do think such a product would be beneficial to the programming community as a whole, and I hope if I don’t do it maybe someone else will.
Arbutus TC v1
January 7th, 2009
So, what have I been doing with my time recently? Bits and pieces of personal project tinkering, and also one small paid project. This project was a website for the Systems Engineering consulting company Arbutus Technical Consulting. It was recognised that they needed an effective web presence to help bring in business for the company, and I was hired to build that website to the specification provided.
The website was specified to be a very simple, mostly static collection of pages including an easy to use blog system for comments the company’s primary consultant had on issues related to Systems Engineering. I designed a simple interface and translated that into a working website which Arbutus can use to advertise themselves to potential clients.
If you’re interested then have a look at what I came up with for Arbutus Technical Consulting.
Setting an Agenda
November 30th, 2008
If you’ve checked up on the site since my last blog entry, you’ve probably noticed I have indeed started on the pages for the rest of the site. The about and contact pages are finished, and the footer now automatically displays the four most recent blog entries. Now comes the main bit of the work, writing a CMS to manage my current and completed projects.
The Main Site
So, what exactly is going to be one the main site? If you’ve seen the front page you can probably mostly guess. There will be project pages for each project I’m currently working on or have completed – and there will be one of each set as “featured” works displayed on the front page and in the blog footer (the other spot will be filled by the most recent project).
The project pages themselves will be written descriptions of the project: what it’s about, what it’s aiming to produce, what I want to learn from it, what’s being used to implement it. It will also have a section for relevant blog articles, which will be automatically fetched from here by selecting all the articles with a given tag (so for the zbot2 project, I will look for a “zbot” tag on blog articles).
This should provide a good source page to refer people to to answer any questions about the project, and can serve as a home for any projects I decide I would like to release publicly.
What About zbot?
I mentioned I was hoping to start zbot v2.0 soon, I will hopefully be starting that almost directly after finishing the main site on here. In the meantime it’s time to start some design for the structure of the program and its source files. When the project does get started I’ll make note here, and hopefully there’ll be a working core available before too long.
Other Work
This site and zbot aren’t the only things I’m doing however, there’s another project I would like to write which I have not really begun looking at yet, but which requires a fair bit of research before I can start. I’ll probably look into a few of the topics I’ll need to write it while I’m working on the other projects (this site and zbot), and will hopefully write a few articles on them to help cement my understanding and to share what I’ve found out.
So, hopefully we’ll see the rest of the main site taking shape over the next week or two. But if I don’t have time to blog for a little while, don’t think I’ve stopped working, hopefully it means quite the opposite, but we’ll have to see.
Personal Site Work
November 26th, 2008
So, as predicted there’s already been a pretty big gap, but that’s not to say I’ve been neglecting the blog. I’ve had a new bespoke design drafted up by a friend I know through a web development community, which I hope to get properly skinned for WP and set up as a personal site on alex-elliott.co.uk soon. The designer is Adam McPeake, who is linked in the blogroll under wized (wized.net is his portfolio).
Previews
So, while I get to work converting the design into a WP theme and writing a CMS for my personal site, here’s a preview of what it should eventually look like (the main site, and the blog):
Hope to have some more updates about this soon, or maybe you’ll be reading this in the lovely new theme.
UPDATE: as you may have noticed the new design has been skinned into WP (and I rediscovered precisely why I hate that theme system). Hopefully things still work, but if they don’t please do comment and let me know.
NOTE: yes, the rest of the site isn’t done yet, what’s up is an example of what it should look like when done (at least the index page), I’ll start on getting the main top links at least drafted into markup now – and I should be able to finish the about and contact pages completely within a few days.
Exciting stuff.
Search
Recent Articles
- Qt on Android with Necessitas
- New Lexing Parsers and What They Mean for Expression Editor
- Back to Work
- Expression Editor Update (2)
- Expression Editor Update
My Projects
- Expression EditorGraphical Regular Expression Editing Tool
- My Pastebin – PasteSite.ComMy Easy Usable NoPaste/Pastebin Application
Blogroll
- Binary VisionThe Technology Musings of Ed Cradock
- Tim Davies sans vowelsThe Blog of Tim Davies
Other Sites
- HideMyAss – Free Proxy and Privacy ToolsHide My Ass! Free Proxy and Privacy Tools – Surf The Web Anonymously
- Iam-andy – Design PortfolioThe Design Portfolio of Andy Mallaby
- Zymic – Free Webhosting and MoreFree Web Hosting, Free Templates, Free Tutorials and more! Zymic Webmaster Resources
Archives
- February 2012 (2)
- July 2010 (1)
- January 2010 (1)
- December 2009 (2)
- November 2009 (1)
- February 2009 (1)
- January 2009 (1)
- December 2008 (1)
- November 2008 (5)
Categories
- android (1)
- arbutus (1)
- c++ (6)
- general (4)
- musing (1)
- php (5)
- polymer (4)
- zbot (3)
- projects (13)
- expressioneditor (6)
- qt (6)
- regular expressions (4)
- web (5)