|   | php.net | support | documentation | report a bug | advanced search | search howto | statistics | random bug | login | 
| 
 PatchesPull RequestsHistoryAllCommentsChangesGit/SVN commits              [2013-11-19 19:40 UTC] adrien dot brault at gmail dot com
  [2013-11-19 19:46 UTC] adrien dot brault at gmail dot com
  [2014-02-28 17:59 UTC] crog at gustavus dot edu
  [2017-01-01 12:30 UTC] nikic@php.net
  [2017-01-07 23:07 UTC] php at laszlokorte dot de
  [2017-02-17 14:03 UTC] hwold at hwold dot net
  [2021-04-27 15:36 UTC] neclimdul at gmail dot com
  [2021-04-27 15:41 UTC] nikic@php.net
 
-Status: Open
+Status: Wont fix
  [2021-04-27 15:41 UTC] nikic@php.net
 | |||||||||||||||||||||||||||||||||||||
|  Copyright © 2001-2025 The PHP Group All rights reserved. | Last updated: Fri Oct 31 11:00:01 2025 UTC | 
Description: ------------ The current serialization format does not specify value ids and instead relies on a count from the beginning of the data. While this works fine for "standard" serialization, it breaks horribly when custom serialization (via Serializable) is thrown into the mix. In the case in which we first discovered this issue, we have an object which saves and loads its data to a file, database or whatever (depending on the current implementation of the save() method). It can then be saved/loaded via the save()/load() methods respectively, and this works fine. To ensure that we weren't wasting time/resources, we also implemented the Serializable interface and a serialize method which simply calls save() and returns a single string which can be used to identify the location of our data (a filename if stored on the file system, row id if in the database, etc.). Conversely, unserialize sets the source and calls load(). This, again, works fine if used in isolation. Now, where the problems arise is when the two are mixed. Our current file system implementation of save() does its own serialization. If we call save() to save our data, but then load through it through the unserialize operation, the serialized data's reference counts are all off by one (as the original serialization didn't go through serialize). The same off-by-one bug shows up if this is done in reverse (saved through serialize(), loaded through load()). The effects of this bug are one of two things: - The values simply point to the wrong variables. Very hard to debug and incredibly confusing to anyone unfamiliar with the serialization format. - The values point to non-existent values (-1 or max +1). This causes an error which reports the byte offset into the serialized data where the error occurred. The test below is rather convoluted for a test case, but demonstrates the problem as it relates to the code in which we discovered it. I can, if necessary, attempt to provide further explanation or additional details. Test script: --------------- class ObjectWithReferences { protected $var1; protected $var2; protected $var3; public function __construct() { $this->var1 = new StdClass(); $this->var2 = null; $this->var3 = $this->var1; } } class WrapperObject implements Serializable { static $tempstorage; protected $obj; public function setObject($obj) { $this->obj = $obj; } public function getObject() { if (is_string($this->obj)) { $this->obj = unserialize($this->obj); } return $this->obj; } public function save() { $archive = (object) [ 'object' => is_string($this->obj) ? $this->obj : serialize($this->obj), ]; static::$tempstorage = json_encode($archive); var_dump("Serialized:", static::$tempstorage); } public function load() { $archive = json_decode(static::$tempstorage); $this->obj = $archive->object; } public function serialize() { $this->save(false); return "wrapper!"; // Doesn't actually matter here. } public function unserialize($serialized) { $this->load(); } } $owr = new ObjectWithReferences(); $wrapper = new WrapperObject(); $wrapper->setObject($owr); $wrapper->save(); $wrapper->load(); var_dump($wrapper->getObject()); $serialized = serialize($wrapper); $wrapper = unserialize($serialized); var_dump($wrapper->getObject()); Expected result: ---------------- string 'Serialized:' (length=11) string '{"object":"O:20:\"ObjectWithReferences\":3:{s:7:\"\u0000*\u0000var1\";O:8:\"stdClass\":0:{}s:7:\"\u0000*\u0000var2\";N;s:7:\"\u0000*\u0000var3\";r:2;}"}' (length=152) object(ObjectWithReferences)[9] protected 'var1' => object(stdClass)[10] protected 'var2' => null protected 'var3' => object(stdClass)[10] string 'Serialized:' (length=11) string '{"object":"O:20:\"ObjectWithReferences\":3:{s:7:\"\u0000*\u0000var1\";O:8:\"stdClass\":0:{}s:7:\"\u0000*\u0000var2\";N;s:7:\"\u0000*\u0000var3\";r:3;}"}' (length=152) object(ObjectWithReferences)[8] protected 'var1' => object(stdClass)[9] protected 'var2' => null protected 'var3' => object(stdClass)[9] Actual result: -------------- string 'Serialized:' (length=11) string '{"object":"O:20:\"ObjectWithReferences\":3:{s:7:\"\u0000*\u0000var1\";O:8:\"stdClass\":0:{}s:7:\"\u0000*\u0000var2\";N;s:7:\"\u0000*\u0000var3\";r:2;}"}' (length=152) object(ObjectWithReferences)[9] protected 'var1' => object(stdClass)[10] protected 'var2' => null protected 'var3' => object(stdClass)[10] string 'Serialized:' (length=11) string '{"object":"O:20:\"ObjectWithReferences\":3:{s:7:\"\u0000*\u0000var1\";O:8:\"stdClass\":0:{}s:7:\"\u0000*\u0000var2\";N;s:7:\"\u0000*\u0000var3\";r:3;}"}' (length=152) object(ObjectWithReferences)[8] protected 'var1' => object(stdClass)[9] protected 'var2' => null protected 'var3' => null