clockwerx | 9 years ago | on: HTML 5.1 is a W3C Recommendation
clockwerx's comments
clockwerx | 9 years ago | on: Taking PHP Seriously
Theres a lot of raise ArgumentError.new unless foo.kind_of? Bar
Going on in my code these days.
clockwerx | 9 years ago | on: Not Just Any Old Geek
Go read about the history of the web or XML; you get a very strong sense that these things have been thought about for a long time - data interchange has varied formats, but there is still a lot of tedious ETL type work. The importance of naming things well hasnt changed. Identifiers for data being a hard thing hasnt changed. Schema/vocabularies and more are still important problems.
If you cant see some of these things underpinning much of the work we do, you might be missing the forest for the trees
clockwerx | 9 years ago | on: Hoaxes and scams on Facebook: How most of them work and spread
clockwerx | 9 years ago | on: Guido van Rossum: “PDF Must Die.” (2014)
PDF as a primary info exchange format online is stupid and needs to die
clockwerx | 9 years ago | on: Machine Learning and Ketosis
clockwerx | 9 years ago | on: Machine Learning and Ketosis
clockwerx | 9 years ago | on: How to Avoid Being Called a Bozo When Producing XML (2005)
Namespaces, which then gives you easy answers for Internationalisation (xml:lang), a subject-predicate-object data structure (RDF), which can lead on to logical meaning/modelling of data (RDFS/OWL), which then lets you look at harder questions like trust/provenance.
There's also schema validation (XSD), transformation (XSLT), which then provides you tools like XPath.
Most of that is on the front page for the technology: https://www.w3.org/standards/xml/
The real problem is not syntax, its communication between groups with differing experiences and interests - how do I know your messages mean the same thing as what my system expects?
If you prove to be malicious, do I have to write a strict validator before I trust your input?
If you want to ensure your messages are well formed before they are sent, do you also have to write a validator?
How do I know our validators are checking the same things?
If you want to send a large document oriented data structure, but I only care about a specific section relating to my interests; do I have to understand where to look and what all of the surrounding material is; or can I query for the relevant bits?
On the more complicated RDF side of things - if you want to share identifiers with me, how do we both avoid calling everything record id=1?
If we are both talking about the same thing but know different parts of the story, how can I recognize your information as describing the same thing I know about?
If we both know about the same Thing, and know certain logical facts about that Thing, can we check those facts actually make sense against shared rules?
If we both know about the same Thing, and can see a logical inconsistency in data, can we reason about which data to Trust and why?
Unfortunately, communicating properly is hard even with all of the tools to help.
We tend to opt towards subjecting systems to an ongoing fuzzing test because we don't value many of the above things - we tend to work in organisations with a short attention span focused on the now and a narrow set of interests. It just kind of works for the 80% of the time, so we move on.
Contrast that with something like a library or museum, and you see why ideas like Dublin Core really catch on there.
clockwerx | 9 years ago | on: Say_what: Using speech-to-text to fully check out during conference calls
clockwerx | 9 years ago | on: Thoughts on the sociology of Brexit
clockwerx | 9 years ago | on: DeepOSM: Detect roads and features in imagery with neural nets using OpenStreetMap
Would you be open to working with them to deliver the potential missing roads; so that you don't have to reimplement a lot of the basics?
clockwerx | 9 years ago | on: DeepOSM: Detect roads and features in imagery with neural nets using OpenStreetMap
For example: http://labs.strava.com/routing-errors/#250/17/-46.42546/-21.... gives a fairly strong indication that something is navigable on bicycle or foot; even if OSM hasn't mapped it yet.
clockwerx | 9 years ago | on: Do Police Body Cameras Really Work?
clockwerx | 10 years ago | on: Bots won't replace apps, only better apps will replace apps
Why not? Technical difficulty - my local pizza shop doesnt care about running a payment processing endpoint, structured data, etc. Its counter to their goals.
What we need is an open source point of sale system that is web first - oscommerce with a desktop UI kind of thing. Then it becomes easy to add in 'publish standard compliant data'
clockwerx | 10 years ago | on: Want to start a small data journalism team in your newsroom?
clockwerx | 10 years ago | on: How Software Gets Bloated: From Telephony to Bitcoin
clockwerx | 10 years ago | on: My Biggest Regret as a Programmer
clockwerx | 10 years ago | on: The smart home freak show stops here
clockwerx | 10 years ago | on: Yelp Fired a Single Mother Today: Me
clockwerx | 10 years ago | on: Heisenberg Developers (2014)
Specific example: RDFa vs microdata. One was well understood, the other made up on the spot. One played well with a huge amount of preceding standards, the other... meh.
While bleeding edge fashion brings great and new things to a culture, it also risks local maxima - a point where you cannot evolve further without negatives.
Examples: Blockbuster vs netflix, Rails new vs computer science; and dinosaurs vs mammals (you cant eat us all despite jurrassic park, you giant chickens!!)